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^ (57) Abstract: The invention relates to 2-0 sulfatase and uses thereof. In particular, the invention relates to recombinantly produced 
O 2-0 sulfatase. functional variants and nucleic acid molecules that encode these molecules. The invention also provides methods of 
O using 2-0 sulfatase for a variety of purposes, including degrading and analyzing glycosaminogl yeans (GAGS) present in a sam- 

pie. For instance. 2-0 sulfatase may be used for determining the purity, identity, composition and sequence of glycosaminoglycans 
Q present in a sample. The invention also relates to methods of inhibiting angiogenesis and cellular proliferation as well as methods 
1^ for treating cancer, neurodegenaative disease, atherosclerosis and microbial infection using 2-0 sulfatase and/or GAG fragments 

produced by degradation with 2-0 sulfatase. 
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2-0 SULFATASE COMPOSITIONS AND RELATED METHODS 



FIELD OF THE INVENTION 

The invention relates to 2-0 sulfatase, related compositions, and methods of use 

5 thereof. 



BACKGROUND OF THE INVENTION 

Sulfated glycosatninoglycans such as heparin and the related heparan sulfate 
(HSGAGs) are complex, linear carbohydrates possessing considerable chemical 

10 heterogeneity (Esko, J. D., and Lindahl, U. (2001) J Clin Invest 108(2), 169-73, Shriver, Z., 
Liu, D., and Sasisekharan, R. (2002) Trends Cardiovasc Med 12(2), 71-72). Their structural 
diversity is largely a consequence of $he variable number and position of sulfates present 
within a single HSGAG chain. Because of their highly anionic character, these 
polysaccharides historically have been relegated to an exclusively structural role, namely as a 

15 sort of hydration gel and scaffold comprising the extracellular matrix (ECM). Contrary to 
this limited perception, however, HSGAGs actually play an important and dynamic function 
in many critical biological processes ranging from development (Perrimon, N., and Bemfield, 
M. (2000) Nature 404(6779), 725^8) and tissue repair (Simeon, A., Wegrowski, Y., 
Bontemps, Y., and Maquart, F. X. (2000) J Invest Dermatol 1 15(6), 962-8) to apoptosis 

20 (Ishikawa, Y., and Kitamura, M. (1999) Kidney bit 56(3), 954-63, Kapila, Y. L., Wang, S., 
Dazdn, P., TafoUa, E., and Mass, M. J. (2002) J Biol Chem 277(10), 8482-91). These 
polysaccharides are also central players in several pathological conditions such as cancer 
(Selva, E. M., and Perrimon, N. (2001) Adv Cancer Res 83, 67-80, Sasisekharan, R., Shriver, 
Z., Venkataraman, G., and Narayanasami, U. (2002) Nat Rev Cancer 2(7), 521-8), 

25 angiogenesis (Folkman, J., and Shing, Y. (1992) Adv Exp Med Biol 313, 355-64, Vlodavsky, 
L, Elkin, M., Pappo, O., Aingom, H., Atzmon, IL, Ishai-MichaeU, R., Aviv, A., Pecker, L, 
and Friedmann, Y. (2000) Isr Med Assoc J 2 Suppl 37-45), certain neurodegenerative 
diseases such as Alzheimers (Cohlberg, J. A., li, J., Uversky, V. N., and Fink, A. L. (2002) 
Biochemistry 41(5), 1502-1 1), athleroscelerosis (Sehayek, E., Olivecrona, T., Bengtsson- 

30 Olivecrona, G., Vlodavsky, I., Levkovitz, H., Avner, R., and Eisenberg, S. (1995) 

Atherosclerosis 1 14(1), 1-8), and microbial infectivity (Liu, J., and Thoip, S. C. (2002) Med 
Res Rev 22(1), 1-25). HSGAGs do so as part of proteoglycans found at the cell surface and 



-^^WO 1004/062592 " *^ _ . PCT/US2004/000332 , 

-2- 

within the ECM where they mediate signaling pathways and cell-cell communication by 
modulating the bioavailability and temporal-spatial distribution of growth factors, cytokines, 
and morphogens (Tumova, S., Woods, A., and Couchman, J. R. (2000) IntJBiochem Cell 
Biol 32(3), 269-88) in addition to various Teceptors and extracellular adhesion molecules 
5 (Lyon, M., and GaUagher, J. T. (1998) Matrix Biol 17(7), 485-93). HSGAG structure and 
function are inextricably related. 

A study of the HSGAG structure-function paradigm (Gallagher, J. T. (1997) Biochem 
Sac Trans 25(4), 1206-9) requires the ability to deteraiine both the overall composition of 
biologically relevant HSGAGs as well as ultimately ascertaining their actual linear sequence 

10 (fine structure). Therefore the availability of several chemical and enzymatic reagents which 
are able to cleave HSGAGs in a structure-specific fashion have proven to be valuable. One 
example of an important class of GAG degrading enzymes is the heparin lyases (heparinases) 
originally isolated from the gram negative soil bacterium R heparinum (Ernst, S., Laager, R., 
Cooney, C. L., and Sasisekharan, R. (1995) Crit Rev Biochem MolBiol 30(5), 387-444). 

15 Each of the three heparinases encoded by this microorganism cleave both heparin and 

heparan sulfate Avith a substrate specificity that is generally based on the differential sulfation 
pattern which exists within each GAG chain (Ernst, S., Langer, R., Cooney, C. L., and 
Sasisekharan, R. (1995) Crit Rev Biochem Mol Biol 30(5), 387-444, Rhomberg, A. J., Ernst, 
S., Sasisekharan, R, andBiemann, K. (1998) Proc Natl Acad Sci USA 95(8), 4176-81). In 

20 fact, F. heparinum uses several additional enzymes m an apparently sequential manner to 
first depolymerize and then subsequently desulfate heparin/heparan sulfate. In addition to 
heparinase I (Sasisekharan, R., Buhner, M., Moremen, K. W., Cooney, C. L., and Langer, R. 
(1993) Proc Natl Acad Sci U SA 90(8), 3660-4), we have recently cloned one of these 
enzymes, the A 4,5 unsaturated glycuronidase (Myette, J. R., Shriver, Z., Kiziltepe, T., 

25 McLean, M. W., Venkataraman, G., and Sasisekharan, R. (2002) Biochemistry 41(23), 7424- 
7434). This enzyme has been recombinaaatly expressed in E, coli as a highly active enzyme. 
Because of its rather unique substrate specificity (Wamick, C. T., and Linker, A. (1972) 
Biochemistry 1 1(4), 568-72), this enzyme has already proven to be a usefixl addition to our 
PEN-MALDI based carbohydrate sequencmg methodology (Venkataraman, G., Shriver, Z., 

30 Raman, R., and Sasisekharan, R. (1999) Science 286(5439), 537-42). 
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SUMMARY OF THF. TNVENTION 

2-0 sulfatase has been cloned jBrom the R heparinum genome and its subsequent 
recombinant expression in E, coli as a soluble, highly active enzyme has been accomplished. 
Thus in one aspect the invention provides for a recombinantly produced 2-0 sulfatase. 
5 Recombinant expression may be accompUshed in one embodiment with an expression vector. 
An expression vector may be a nucleic acid for SEQ ID N0:1, optionally operably linked to a 
promoter. In another embodiment the expression vector may be a nucleic acid for SEQ ID 
NO: 3 or a variant thereof also optionally linked to a promoter. In one embodiment the 
recombinantly expressed 2-0 sulfatase is produced using a host cell comprismg the 
10 expression vector. In another embodiment the expression vector may comprise any of the 
isolated nucleic acid molecules provided herein. In some embodiments the protein yields 
using the recombinantly expressed 2-0 sulfatases provided herein exceed 100 mg of sulfatase 
enzyme per liter of induced bacterial cultures. In other embodiments the protein yield is 1 10, 
115, 120, 125, 130, 150, 175, 200 mg or more per hter of induced bacterial culture. In other 
15 aspects methods of achieving such protein yields are provided comprising recombinantly 
expressing 2-0 sulfatase and using at least one chromatographic step. 

In another aspect of the invention isolated nucleic acid molecules are provided. The 
nucleic acid molecules may be (a) nucleic acid molecules which hybridize under stringent 
conditions to a nucleic acid molecule having a nucleotide sequence set forth as SEQ ID NO: 
20 1 or SEQ ID NO: 3, and which code for a 2-0 sulfatase, (b) nucleic acid molecules that differ 
from the nucleic acid molecules of (a) in codon sequence due to degeneracy of the genetic 
code, or (c) complements of (a) or (b). Li one embodiment the isolated nucleic acid molecule 
comprises the nucleotide sequence set forth as SEQ ID NO: 1. In another embodiment the 
isolated nucleic acid molecule comprises the nucleotide sequence set forth as SEQ ID NO: 3. 
25 In still other embodiments the isolated nucleic acid molecule codes for SEQ ID NO: 2, and in 
yet other embodiments the isolated nucleic acid molecule codes for SEQ ID NO: 4. 

The isolated nucleic acid molecules of the invention are also intended to encompass 
homologs and alleles, hi one aspect of the invention, the isolated nucleic acid molecules are 
at least about 90% identical to the nucleotide sequence set forth as SEQ ID NO: 1 or 3. In 
30 other embodiments, isolated nucleic acid molecules that are at least 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 1 or 3 are given. In still other 
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embodiments the isolated nucleic acid molecules are at least 99.5% or 99.9% identical to the 
nucleotide sequence set forth as SEQ JD NO: 1 or 3. 

Therefore, in one aspect of the invention a 2-0 sulfatase molecule produced by 
expressing the nucleic acid molecules provided herein is given. In some embodiments, as 
described above, the nucleic acid molecule is expressed recombinantly. In one embodiment 
the recombinant expression is carried out in E. colL 

In another aspect the 2-0 sulfatase of the invention is a polypeptide having an amino 
acid sequence of SEQ ID NO: 2, or a functional variant thereof In yet another aspect the 
polypeptide has an amino acid sequence of SEQ ID NO: 4, or a functional variant thereof. In 
still another aspect of the invention the 2-0 sulfatase is an isolated 2-0 sulfatase. In yet 
another embodiment the isolated 2-0 sulfatase is synthetic. In yet another aspect of the 
invention an isolated polypeptide which comprises a 2-0 sulfatase is also provided. The 
isolated polypeptide in some embodiments comprises a 2-0 sulfatase having an amino acid 
sequence set forth as SEQ ID NO: 2. In other embodiments, the isolated polypeptide 
comprises a 2-0 sulfatase v^hich has the amino acid sequence as set forth as SEQ ID NO: 4. 
In still other embodiments the isolated polypeptide comprises a 2-0 sulfatase which has the 
amino acid sequence as set forth as SEQ ID NO: 2 or 4 or functional variants thereof 
In one aspect of the invention, therefore, 2-0 sulfatase functional variants are 
provided. In one embodiment the 2-0 sulfatase fimctional variants include 2-0 sulfatases 
that contain at least one amino acid substitution. In another embodiment the 2-0 sulfatase 
fimctional variants contain 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 
30, 40 or more amino acid substitutions. In some of these embodiments the 2-0 sulfatase 
functional variants are 2-0 sulfatases that function similarly to the native 2-0 sulfatase. In 
other embodiments the 2-0 sxilfatase functional variants are 2-0 sulfatases that function 
differently than the native 2-0 sulfatase. The different function can be, for instance, altered 
enzymatic activity or different substrate affinity. For example, as described herein, there are 
specific active site amino acids that are positioned to interact with specific constituents of 
glycosaminoglycans (e.g., Lys 175, Lys 238 with the planar carboxyl group of the uronic 
acid; Lys 107 and possibly Thr 104 vn&x the 6-0 sulfate of the glucosamine; and Lys 134, 
Lys 308 with the 2-0 stilfate). Therefore, 2-0 sulfatase functional variants can maintain 
these residues or contain amino acid substitutions at these residues to maintain or alter, 
respectively, the enzyme's function on a specific substrate. In yet other embodiments the 
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amino acid substitutions occur outside of the active and binding sites as described herein. In 
still other embodiments the active and binding sites are targeted for substitution, hi some of 
the foregoing embodiments the amino acid substitutions occur outside of the catalytic domain 
given in SEQ ID NO: 6. In other embodiments the amino acid substitutions occur within this 
catalytic domain. In still other embodiments the choice of amino acid substitutions can be 
based on the residues that are found to be conserved between the various sulfatase enzymes 
(e.g., see the sequence alignments provided in Figs. 3, 9 and 16) (e.g., highly conserved His 
136, His 191, Asp 42, Asp 63, Asp 295). Amino acid substitutions can be conservative or 
non-conservative. 

In one aspect of the invention the amino acid sequence of the isolated polypeptide 
contains (a) at least one residue selected from Arg 86, Asp 42, Asp 159, Asp 295, Cys 82, 
FGly 82, Ghi 43, Ghi 237, Glu 106, Ghi 309, His 136, His 296, Leu 390, Leu 391, Leu 392, 
Lys 107, Lys 134, Lys 175, Lys 238, Lys 308 or Thr 104 and (b) at least one amino acid 
substitution. In one embodiment of the invention the amino acid sequence of the isolated 
polypeptide contains a Cys 82 residue and at least one amino acid substitution. In another 
embodiment the isolated polypeptide contains a Cys 82 residue which is subsequently 
modified to formyl glycine and at least one amino acid substitution. Li still other 
embodiments the isolated polypeptide contains a FGly 82 residue and at least one amino acid 
substitution. 

In another aspect of the invention fimctional variants include a 2-0 sulfatase which 
contains at least one amino acid residue that has been substituted with a different amino acid 
than in native 2-0 sulfatase and wherein the residue that has been substituted is selected from 
Arg 86, Asp 42, Asp 159, Asp 295, Ghi 43, Gha 237, Glu 106, Ghi 309, His 136, His 296, 
Leu 390, Leu 391, Leu 392, Lys 107, Lys 134, Lys 175, Lys 238, Lys 308 and Thr 104. 

In another aspect, the invention is a composition comprising, an isolated 2-0 sulfatase 
having a higher specific activity than native 2-0 sulfatase. In some embodiments, the 2-0 
sulfatase has a specific activity that is at least about 5- fold higher than native 2-0 sulfatase. 
The specific activity of the 2-0 sulfatase in other embodiments maybe 6-, 7-, 8-, 9-, 10-, 11-, 
12-, 13-, 14-, 15-, 16-, 17-, 18-, or 19-fold higher than the specific activity of the native 
enzyme. In other embodiments the specific activity may be about 20-, 25-, 30-, 40- or 50- 
fold higher. In one embodiment the 2-0 sulfatase has a specific activity that is about ten fold 
higher than the specific activity of the native enzyme. 
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In another aspect the invention also provides a method of degrading a 
glycosaminoglycan. The method may be performed by contacting the glycosaminoglycan 
with a 2-0 sulfatase of the invention in an effective amount to degrade the 
glycosaminoglycan. In other embodiments the method may be performed by contacting the 
glycosaminoglycan with at least one other glycosaminoglycan degrading enzyme, hi some 
embodiments the at least one other glycosaminoglycan degrading enzyme is heparinase or 
glycnronidase. In other embodiments the glycosaminoglycan is contacted with the at least 
one other glycosaminoglycan degrading enzyme concomitantly with the 2-0 sulfatase. In 
still other embodiments the glycosaminoglycan is contacted with the at least one other 
glycosaminoglycan degrading enzyme prior to or subsequent to contacting the 
glycosaminoglycan with 2-0 sulfatase. In still another embodiment the glycosaminoglycan is 
contacted with a heparinase prior to contact with a 2-0 sulfatase. 

In some embodiments the glycosaminoglycan is a long chain saccharide. In such 
embodiments the glycosaminoglycan is a tetrasaccharide or a decasaccharide. In other 
embodiments the glycosaminoglycan contains a 2-0 sulfated uronic acid at the non-reducing 
end. In stiU other embodunents the glycosammoglycancontaiiis a |Jl-^4 linkage. In yet 
another embodiment the glycosaminoglycan is a chondroitin sulfate. In other embodiments 
the glycosaminoglycan is a highly sulfated glycosanoinoglycan. In such embodiments the 
highly sulfated glycosaminoglycan contains a 6-0 sulfated glucosamine. In yet other 
embodiments the highly sulfated glycosaminoglycan contains a glucosamine sulfated at the 
N-position. 

In some aspects of the invention degraded glycosaminoglycans prepared by the 
methods described herein are provided. In still other aspects of the invention a composition 
which contains a degraded glycosaminoglycan is given. In still another aspect of the 
invention the composition is a pharmaceutical preparation which also contains a 
pharmaceutically acceptable carrier. 

The present invention also provides methods for the analysis of a glycosaminoglycan 
or group of glycosaminoglycans. In one aspect the invention is a method of analyzing a 
glycosaminoglycan by contacting a glycosaminoglycan with the 2-0 sulfatase of the 
invention in an effective amount to analyze the glycosaminoglycan. 
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The present invention also provides 2-0 sulfatase immobilized on a solid support. In 
another embodiment at least one other glycosaminoglycan degrading enzyme is also 
immobilized on the solid support. 

In one aspect of the invention a method for identifying the presence of a particular 
5 glycosaminoglycan in a sample is provided. In another aspect of the invention a method for 
determining the identity of a glycosaminoglycan in a sample is provided. In yet another 
aspect of the invention a mefliod for determining the purity of a glycosaminoglycan in a 
sample is also provided. In still a further aspect of the invention a method for determining 
the composition of a glycosaminoglycan in a sample is provided. Yet another aspect of the 
10 invention is a method for determining the sequence of saccharide units in a 

glycosaminoglycan. In some embodiments, these methods can further comprise an additional 
analytical technique such as mass spectrometry, gel electrophoresis, capillary electrophoresis 
orHPLC. 

In another aspect the invention is a method of inhibiting angiogenesis by 
15 administering to a subject in need thereof an effective amount of any of the pharmaceutical 
preparations described herein for inhibiting angiogenesis. 

In another aspect a method of treating cancer by administering to a subject in need 
thereof an effective amount of any of the pharmaceutical preparations described herein for 
treating cancer is also provided, 
20 Yet another aspect of the invention is a method of inhibiting cellular proliferation by 

administering to a subject in need thereof an effective amount of any of the pharmaceutical 
preparations described herein for inhibiting cellular proliferation. 

In yet another aspect of the invention a method of treating neurodegenerative disease 
by achninistering to a subject in need thereof an effective amount of any of the 
25 pharmaceutical preparations described herein for treating neurodegenerative disease is 
provided. In one embodiment the neurodegenerative disease is Alzheimer's disease. 

Another aspect of the invention is a method of treating atherosclerosis by 
ad m i n istering to a subject m need thereof an effective amount of any of the pharmaceutical 
preparations described herein for treating atherosclerosis. 
30 Jn another aspect of the invention a method of treating or preventing micrebial 

infection by administering to a subject in need thereof an effective amount of any of the 
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phannaceutical preparations described herein for treating or preventing microbial infection is 
given. 

In yet another aspect of the invention a method of controlling apoptosis by 
administering to a subject in need thereof an effective amount of any of the pharmaceutical 
5 preparations described herein for controlling apoptosis is provided. 

In other aspects of the invention methods of repairing tissue or controlling 
development are also provided. 

In some embodiments of the methods of the invention the 2-0 sidfatase is used 
concurrently with, prior to or following treatment with at least one other glycosaminoglycan 
10 degrading enzyme. In some embodiments the at least one other glycosaminoglycan 

degrading enzyme is heparinase or glycuronidase. In some embodiments of the compositions 
or pharmacetical preparations of the invention other enzymes such as heparinase and/or 
glycmronidase may be included. 

In other aspects of the invention, compositions, pharmaceutical preparations and 
15 therapeutic methods are provided with/using the 2-0 sulfatase or the degraded 
glycosamiaoglycans alone or in combination. 

Compositions of any of the 2-0 sulfatases, degraded glycosaminoglycans, nucleic 
acids, polypeptides, host cells or vectors described herein are also encompassed in the 
invention. Pharmaceutical preparations of any composition provided herein are also provided 
20 in some embodiments. In these embodiments the pharmaceutical preparations contain a 
pharmaceutically acceptable carrier. 

In still another aspect of the invention, a substantially pure, non-recombinantly 
produced 2-0 sulfatase that has a purity that is about 3000-fold greater than cmde bacterial 
lysate is provided. In some embodiments the purity of the substantially pure, non- 
25 recombiDantly produced 2-0 sulfatase is about 4000-, 5000-, 6000-, 7000-, 8000-, 9000- or 
10,000-fold more pure than crude bacterial lysate. In some embodiments the substantially 
pure, non-recombinantly produced 2-0 sulfatase is obtained by a multi-step fractionation 
method. In one embodiment the method is a five-step fractionation method. In this aspect of 
the invention, the term "substantially pure" means that the proteins are essentially free of 
30 other substances to an extent practical and appropriate for their intended use. 
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Each of the limitations of the invention can encompass various embodiments of the 
invention. It is, therefore, anticipated that each of the limitations of the invention involving 
any one element or combinations of elements can be included in each aspect of the invention. 

These and other aspects of the invention, as well as various advantages and utilities, 
will be more apparent with reference to tbe detailed description of the preferred 
embodiments. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 provides the results of Flavobacteriimi 2-0 sulfatase purijScation and 
proteolysis. Panel (A) provides tiie final RP-HPLC chromatography of blue-Sepharose CL- 
6B purified sulfatase. Panel (B) illustrates the C4 RP-HPLC chromatographic resolution of 
sulfatase peptides generated by a limit trypsin digestion of the major peak shown in 
Panel (A). 

Fig* 2 provides the F. heparinum 2-0 sulfatase coding sequence (open reading frame 
from genomic clone S4A. The nucleic acid and amino acid sequence (SEQ ID NOs: 1 and 2, 
respectively) of the fiill length gene for the 2-0 sulfatase begins with the first methionine (the 
nucleic acid and amino acid sequences including the sequence upstream of the first 
methionine are provided as SEQ ID NOs: 38 and 39, respectively). The nucleic acid and 
amino acid sequence of the truncated 2-0 sulfatase which lacks the first 24 amino acids 
(herein referred to as 2-0 AN^"^"*) of the fiill length gene are given as SEQ ID NOs: 3 and 4, 
respectively. Translation initiation and termination codons are shown in bold. Primers used 
in original PGR screen are noted by horizontal arrows. Internal Nde 1 site is double 
underscored. Corresponding amino acid sequence of select sulfatase peptides are boxed. 
Sulfatase consensus sequence CXPXRXXXXS/TG (SEQ ID NO: 5) is boxed and shaded 
with active site cysteine at position 82 noted by an asterisk. Putative signal sequence is 
overscored with predicted peptidase cleavage site represented by a vertical arrow. 

Fig. 3 depicts a 2-0 sulfatase miiltiple sequence alignment The flavobacterial 
enzyme is a member of a large sulfatase family. Alignment shown excludes 2-0 sulfatase 
carboxy tCTninus (amino acids 374-468). The putative active site is boxed with critically 
modified cysteine noted by an astmsk. Invariant residues are shaded in dark gray, partial 
identity in light gray, conservative substitutions in charcoal. Multiple sequence alignment 
was generated by ClustalW using only select bacterial sequences identified from a BLASTP 
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search of the protein database. Mammalian sulfatases are not included. Most sequences 
listed coxrespond to the open reading frame of genes to which only a putative sulfatase 
fiinction has been ascribed. GenBank accession numbers are as follows: AA605721 
(Pseuodmonas aeruginoasa.); AL355753 {Str^tomyces coelicolor ); BAB79937 {E, coli 
0157:H7); kKBllSlQ {Prevotella sp, MdsA gene); AAL:45441 (Agrobacterium 
tumefaciens); AAL19003 (Salmonella typhimurium ). 

Fig. 4 provides the results from the purification of recombinant 2-0 sulfatase from E. 
coli lysates by Ni"^^ chelation chromatography. Enzyme purity following each fractionation 
step was assessed by silver-staining of 12% SDS-polyacrylamide gels. Approximately 200 
ng of total protein was loaded in each well. Lane 1, bacterial lysate from uninduced (minus 
IPTG) control; lane 2, whole cell lysate; lane 3, 20,000 X g supernatant (column pre-load); 
lane 4, eluate from Ni"^^ chelation chromatography; lane 5, 2-0 sulfatase foUovraig thrombin 
cleavage to remove NH2 6X histidine purification tag. Molecular weight markers (Mr) and 
their corresponding masses are also shown. 

Fig. 5 illustrates the exclusive desulfation of the 2-OH position by the recombinant 
sulfatase. Panel (A) depicts the enzyme desulfating activity assayed by capillary 
electrophoresis using the 2-0 containing trisulfated heparin disaccharide AU2sHns,6s. Panel 
(B) depicts the activity using its disulfated coxmterpart to AU2sHns.6S lacking a sulfate at the 
2-OH position. Only in Panel (A) is a loss of sulfate observed. Minus enzyme control is 
shown as a dotted line. 

Fig. 6 provides the in vitro biochemical reaction conditions for the recombinant 2-0 
sulfatase. Panel (A) illustrates the eJBFect of pH. Sulfatase catalytic efficiency (kcat/Km) was 
measured as a function of varying pH from 5 to 8 using two overlapping buffers: 50 mM 
MES (solid circles) and 50 mM MOPS (open circles). Inset: Relative effect of three 
different assay buffers (each at pH 6.5) on optimal enzyme activity. 1. 50 mM MES; 2. 50 
mM imidazole; 3. 50 mM sodium phosphate. Panel (B) illustrates the effect of ionic 
strength. Shown here is % activity normaUzed to 50 mM NaCl. Panel (Q illustrates the 
effect of reaction temperature. Data is normahzed to 30^*0 activity (100%). The unsaturated 
disaccharide AU2sHns was used in all three experiments. 

Fig, 7 illustrates the substrate-product relationship between the 2-0 sulfatase and the 
A 4,5 glycuronidase. 2 mM of the unsaturated, 2-0 sulfated heparin disaccharide AU2sH>js 
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was preincubated with either 250 nM A 4,5 glycuronidase or 25 nM 2-0 AN^"^"^ for two 

o 

minutes at 30 C in a 100 ^lL reaction. Following this preincubation, the reciprocal enzyme 
was added to the reaction for up to six extra minutes. A 4,5 glycuronidase activity was 
measured in real time as the rate of substrate disappearance monitored by the loss of UV 
5 absorption at 232 nm. Zero time on the x-axis represents the time following the 
preincubation during which the second enzyme was added. 

Fig, 8 illustrates the results of the tandem use of 2-0 sulfatase and A 4,5 
glycuronidase in HSGAG compositional analyses. Panel (A) provides the results of 
exhaustively cleaving 200 ^g heparin with heparinase I, II and in. These heparinase- 

10 generated saccharides were then subjected to hydrolysis by the A 4,5 glycuronidase. Panel 
(B) provides the results of subsequent hydrolysis by 2-0 sulfatase after the heparinase 
treament. Panel (C) illustrates subsequent hydrolysis by 2-0 sulfatase and by A 4,5 
glycuronidase added simultaneously. Panel (D) depicts the 7 disaccharide peaks (and one 
tetrasaccharide peak) resolved by capillary electrophoresis (each numbered separately). Their 

15 compositional assigmnents are as follows: AU2sHns,6s (1); AUHnac,6sGHns3s,6s 
tetrasaccharide (2); AUssHns (3); AUHns,6s (4); AU2sHnac,6S (5); AUHns (6); AUzsHnac (7); 
andAUHNAc,6s(8). 

Kg. 9 illustrates the multiple sequence alignment of sulfatases using ClustalW. The 
sequence of F. heparinum 2-0 sulfatase (F20S) was aUgned with human arylsulfatase B 

20 (ARSB), human arylsulfatase A (ARSA) and P, aeruginosa arylsulfatase (PARS). The 
amino and carboxyl termini are not shown. The sequence numbers for each sulfatase are 
listed on the right. The numbers listed above the alignment correspond specifically to F20S 
sequence positions (see Figure 2 above). The critical active site cysteines are highhghted in 
black. Other highly conserved amino acids are highhghted in gray. 

25 Fig. 10 provides the structural model of 2-0 sulfatase and topology of the active site. 

Panel (A) is the ribbon diagram of the proposed 2-0 sulfatase stmcture constructed using 
homology modeling of the crystal structure of human arylsulfatase B. The p strands are 
shown as thicker areas of the ribbon diagram, and the a helices are shown as cylindricaUy 
shaped areas. The geminal diol form of the modified cysteine is also depicted (rendered as 

30 CPK; carbon and oxygen molecules are shown). The direction of substrate difiiising into the 
active site is indicated by an arrow. Panel (B) provides the CPK rendering of the top view of 
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the structure shown in Panel (A). The modified cysteine, the surrounding basic amino acids 
(Arg, His and Lys), acidic amino acids (Asp, Glu), and Ghi and Asn are all shown. Note that 
the active site geminal diol is located in the hottom of a deep cleft. 

Fig. 11 depicts the active site amino acids and their interaction with AUasHjss.es- 

5 Panel (A) is the stereo view of the 2-0 snlfatase active site highlighting important amino 
acids (shown here by a stick representation). Acidic amino acids (Asp), Ghi, Thr, Leu, and 
FGly 82 are depicted. The docked disaccharide is also shown using a stick representation. 
The sulfur atom of the 2-0 sulfate group (next to the lowest positioned oxygen) and oxygen 
atoms (circled) of the 2-0 sulfate group and the planar carboxyl group are also depicted. 

10 Panel (B) provides the schematic representation of the amino acids shown in Panel (A). 
Potential metal ion coordination is also shown with the divalent cation (Mg^"^ depicted as a 
gray circle. 

Fig. 12 illustrates the exolytic activity of the 2-0 sulfatase by analyzing the ability of 
the sulfatase to hydrolyze internally positioned 2-0 sulfates within the ATIO decasaccharide 

15 and subsequent compositional analyses of the heparinase-treated product. Panel (A) shows 
the AT-10 decasaccharide sequence with PEN-MALDI nomenclature and outline of 
experimental design. Panel (B) provides the capillary electrophoretogram for both the 
control and sulfatase pre-treated samples along with their saccharide compositional 
assignments. Heparinase cleavage products foUownig siilfatase pre-treatment are shown as a 

20 dashed line (with gray fill). Minus sulfatase control is shown as a white line (no fill). The 
pentasulfated tetrasaccharide (4, -7) is also noted. Disappearance of the trisulfated 
disaccharide (D) by one-third and the corresponding appearance of the 2-0 desulfated 
product (AIJHns,6s) are depicted by vertical arrows. The minor tetrasaccharide contaminant 
is noted by an asterisk. 

25 Fig. 13 illustrates the steady-state kinetics for various unsaturated disaccharide 

substrates. Panel (A) provides the initial rates detennined using 25 nM enzyme under 
standard conditions. Substrate saturation data were fit to pseudo-first order Michaehs- 
Menten assumptions using a non-linear least squares analysis. AUasHNac (A); AU2sHNac6s (•); 
AU2sHns (A.); AU2sH>,s.6s (0); AU2sGalNAc,6s (+)■ 
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Fig. 14 provides the comparable CD spectroscopy of the wild-type 2-0 AN^'^"* 
sulfatase and C82A site-directed mutant— wild-type enzyme (•), C82A mutaat (O). Band 
intensities are expressed as molar ellipticities with units indicated. 

Fig, 15 illustrates the identification of 2-0 sulfatase active site modification (FGly) by 
5 chemical labeling and mass spectrometry. Wild-type sulfatase (2-0 AN^"^"^) and C82A 
mutant were reacted with Texas Red Hydrazide and subjected to trypsin proteolysis as 
described in Materiab and Methods. The molecular masses of the resultant peptides were 
subsequently characterized by MALDI-MS. Panel (A) shows the unlabeled wild-type 
sulfatase control. Panel (B) shows the covalently labeled wild-type sulfatase. Panel (C) 
10 shows the C82A mutant refractory to chemical labeling. A unique molecular mass signature 
in Panel (B) is noted by an asterisk. 

Fig. 16 shows a multiple sequence alignment of the sulfatases using ClustalW. The 
putative active site is boxed, with critically modified cysteine noted by an asterisk. Invariant 
residues are shaded in dark gray, those with partial identity in light gray, and conservative 
15 substitutions in charcoal. Multiple sequence alignment was generated by ClustalW using 
only select sequences identified from a BLAST? search of the protein data base. MammaUan 
sulfatases are included. Enzymes are abbreviated as follows. FH2S, F. heparinum 2-0- 
sulfatase; PARS, P, aeruginosa arylsulfatase; MDSA, Prevotella sp, MdsA gene; HGal6S, 
human N-acetylgalactosamine- 6-sulfate sulfatase (chondroitin 6-sulfatase); HARSA, human 
20 cerebroside-3-sulfate sulfatase (arylsulfatase A); HARSB, human N-acetylgalactosamine-4 
sulfate sulfatase (arylsulfatase B); HIZS, human iduronate-2-sulfate sulfatase; cons, 
consensus sequence. The GenBankTM protein accession numbers for sulfatases listed are as 
follows: CAA88421, P. aeruginosa arylsulfatase^ kJ^FllSlQ, Prevotella sp, MdsA mucin 
desulfating gene; AAC51350, Homo sapiens N-acetylgalactosamine-6-sulfate sulfatase; 
25 AAB03341, H. sapiens cerebroside-3-sulfate sulfatase (arylsulfatase A); AAA51784, H. 
sapiens N-acetylgalactosamine-4-sulfate sulfatase (arylsulfatase B); AAA63197, H. sapiens 
iduronate-2-sulfate sulfatase. 



DETAILED DESCRIPTION OF THE INVENTION 

Heparin and heparin sulfate glycosaminoglycans (HSGAGs) are structurally complex 
linear polysaccharides (Esko, J. D., and Lindahl, U. (2001) J Clin Invest 108(2), 169-73, 
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Lindahl, U., Kusche-Gullberg, M, and Kjelleii, L. (1998) J Biol Chem 273(39), 24979-82) 
comprised of repeating disaccharides of uronic acid (c6-L-iduronic or /3-D-glucuronic) linked 
l-> 4 to ct-D-glucosamine. The extensive chemical heterogeneity of these biopolymers derives 
from both the variable immber of their constituent disaccharides as well as the combinatorial 
potential for chemical modification at specific positions within each of these building blocks. 
Such modifications include acetylation or sulfation at the N-position of the glucosamine, 
epimerization of glucuromc acid to iduronic acid and additional sulfations at the 2-0 position 
of the uronic acid in addition to the 3-0, 6-0 position of the adjoining glucosamine. It is a 
highly variable sulfation pattern, in particular, that ascribes to each GAG chain a unique 
structural signature. In turn, this signature dictates specific GAG-protein interactions 
underlying critical biological processes related to cell and tissue function. 

One of the more fomiidable challenges currently facing the glycobiology field is the 
design of effective analytical methods to study this structure-function relationship at the 
molecular level. Given this critical stmcture-fimction relationship of GAG sulfation, enzymes 
which can hydrolyze these sulfates in a structurally-specific manner become important in 
several ways. To begin with, the systematic desulfation of GAGs at discrete positions is 
central to GAG catabolism that occurs in divergent organisms ranging fi:om bacteria to 
mammals. In addition, the in vivo desulfation of intact GAG chains both at discrete chemical 
positions and in a cell specific, temporally relevant context is also likely to serve as an 
important molecular switch for abrogating targeted GAG-protein interactions. 

2-0 sulfatase is a desulfating enzyme that can be now added to the repertoire of 
enzymes used to analyze GAGs and degrade them in a specific manner. As used herein, the 
term "degraded glycosaminoglycan" or "GAG jfragmenf ' is intended to encompass a 
glycosaminoglycan that has been altered from its original form by the activity of a 2-0 
sulfatase or other enzyme that can act thereon. The degraded glycosaminoglycan includes 
glycosaminoglycans that have been altered by the activity of a 2-0 sulfatase in some 
combination with other glycosaminoglycan degrading enzymes as described herein. The 
degraded glycosamuioglycan may be desulfated, cleaved or desulfated and cleaved. Any of the 
degraded products produced by the activity of the 2-0 sulfatase and/or other enzymes on the 
glycosaminoglycan are intended to be used in the compositions, pharmaceutical preparations 
and methods of the invention. In addition, this sulfatase can be used in treatment methods 
along with the GAG fragments they degrade. 2-0 sulfatase is a member of a large enzyme 
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family that hydrolyze a wide array of sulfate esters (for a review, see (Parenti, G., Meroni, G., 
and Ballabio, A. (1997) Curr Opin Genet Dev 7(3), 386-91, von Figura, K., Schmidt, B., 
Selmer, T., and Dierks, T. (1998) Bioessays 20(6), 505-10)). This enzyme exhibits 2^0 
specific sulfatase activity as measured using the trisidfated, unsaturated heparin disaccharide 
5 AU2sHns,6S as a substrate (described below). The activity of the enzyme is not limited to 2-0 
desulfation alone, however, as 2-0 sul&tase was found to hydrolyze at the 6-0 and 2N 
positions of glucosamine. 2-0 sulfatase can be used to hydrolyze heparin and chondroitin 
disaccharides and can also desulfate GAGs with longer chain lengths such as tetra- and 
decasaccharides. Furthennore, 2-0 sulfatase has been found to work with other GAG 

10 degrading enzymes such as heparinases and A 4,5 glycuronidase and can be used in 
conjunction with these other enzymes as described herein. 

Like the A 4,5 glycuronidase, which we have recently cloned and expressed (Myette, 
J. R., Shriver, Z., Kiziltepe, T., McLean, M. W., Venkataraman, G., and Sasisekharan, R. 
(2002) Biochemistry 41(23), 7424-7434), we have successfully cloned from Flavobacterium 

15 hepannum and expressed the 2-0 sulfatase in E. coli, from which milligram quantities of 
highly active, soluble enzyme were readily purified. As was also the case for the 
glycuronidase, we found that the yield of soluble recombinant enzyme was greatly improved 
by the engineered removal of the hydrophobic N-terminal signal sequence comprised of the 
first 24 amino acids. This signal sequence was predicted by the von Heinje method which 

20 also identified the likely signal peptidase cleavage recognition sequence AXAXA By 

engineering a 2-0 sulfatase N-terminal truncation lacking this sequence (herein referred to as 
2-0 AN^'^\ we achieved protein yields exceeding 100 mg of relatively pure sulfatase per 
liter of induced bacterial cultures using a single chromatographic step. 

The invention, therefore, provides, in part, a recombinantly produced 2-0 sulfatase. 

25 As used herein, a '"recombinant 2-0 sulfatase" is a 2-0 sulfatase that has been produced 
through human manipulation of the nucleic acid that encodes the enzyme. The human 
manipulation usually involves joining the nucleic acid that encodes tiie 2-0 sulfatase to the 
genetic material of a different organism and, generally, a different species. "Recombinant** is 
a term of art that is readily known to one of skill, and techniques for the recombinant 

30 expression of 2-0 sulfatase are readily available to those of skill in the art and include those 
described in Sambrook et al.. Molecular Cloning-A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., (1989) or Currrait Protocols in Molecular Biology 
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Volumes 1-3, John Wiley & Sons, Inc. (1994-1998). Other techniques for recombinant 
expression including examples of expression systems are described further below. 

As provided herein, recombinant technology can be used to produce a 2-0 sulfatase 
encoded by the nucleic acid sequence of SEQ ID NO: 1 or having the amino acid sequence of 

5 SEQ ID NO: 2. In other aspects of the invention a 2-0 sulfatase encoded by the nucleic acid 
sequence of SEQ ID NO: 3 or having the amino acid sequence of SEQ ID NO: 4 can be 
prepared. The 2-0 sulfatase as provided herein is, in general, produced through the 
manipulation of isolated nucleic acids. 

The invention also provides the isolated nucleic acid molecules that code for a 2-0 

10 sulfatase as described herein. The term "isolated nucleic acid", as used herein, means: (i) 
amphfied in vitro by, for example, polymerase chain reaction (PGR); (ii) recombinantly 
produced by cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, 
for example, chemical synthesis. An isolated nucleic acid is one which is readily 
manipulable by recombinant DNA techniques well known in the art. Thus, a nucleotide 

15 sequence contained in a vector La which 5' and 3' restriction sites are known or for which 
polymerase chain reaction (PGR) primer sequences have been disclosed is considered 
isolated but a nucleic acid sequence existing in its native state in its natural host is not. An 
isolated nucleic acid may be substantially purified, but need not be. For example, a nucleic 
acid that is isolated within a cloning or expression vector is not pure in that it may comprise 

20 only a tiny percentage of the material in the cell in which it resides. Such a nucleic acid is 
isolated, however, as the term is used herein because it is readily manipulable by standard 
techniques known to those of ordinary skill in the art. 

According to the invention, isolated nucleic acid molecules that code for a 2-0 
sulfatase include: (a) nucleic acid molecules which hybridize xmder stringent conditions to a 

25 molecule selected firom a group consisting of the nucleotide sequences set forth as SEQ ID 
NO: 1 and 3 and which code for a 2-0 sulfatase or parts thereof, (b) deletions, additions and 
substitutions of (a) which code for a 2-0 sulfatase or parts thereof, (c) nucleic acid molecules 
that differ firom the nucleic acid molecules of (a) or (b) in codon sequence due to the 
degeneracy of the genetic code, and (d) complements of (a), (b) or (c). The isolated nucleic 

30 acid molecules include isolated nucleic acid molecules that code for a 2-0 sulfatase which 
has an amino acid sequence set forth as SEQ ID NOs: 2 and 4. 
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The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
5 art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating 2-0 
sulfatase. Similarly, nucleotide sequence triplets which encode other amino acid residues 
include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, CGC, 
CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and ACT (threonine 

10 codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT (isoleucine codons). 
Other amino acid residues may be encoded similarly by multiple nucleotide sequences. Thus, 
the invention embraces degenerate nucleic acids that differ from the biologically isolated 
nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The isolated nucleic acid molecules of the invention are also intended to encompass 

15 homologs and alleles which can be identified by conventional techniques. Identification of 
human and other organism homologs of 2-0 sulfatase polypeptides will be famihar to those 
of skill in the art. In general, nucleic acid hybridization is a suitable method for identification 
of homologous sequences of another species (e.g., human, cow, sheep), which correspond to 
a known sequence. Standard nucleic acid hybridization procedures can be used to identify 

20 related nucleic acid sequences of selected percent identity. For example, one can construct a 
Ubrary of cDNAs reverse transcribed from the mRNA of a selected tissue and use the nucleic 
acids that encode a 2-0 sulfatase identified herein to screen the Hbrary for related nucleotide 
sequences. The screening preferably is performed using high-stringency conditions to 
identify those sequences that are closely related by sequence identity. Nucleic acids so 

25 identified can be translated into polypeptides and the polypeptides can be tested for activity. 

The term "stringent conditions" as used herein refers to parameters with which the art 
is familiar. Such parameters include salt, temperature, length of the probe, etc. The amount 
of resulting base mismatch upon hybridization can range fit>m near 0% ("high stringency") to 
about 30% ("low stringency"). Nucleic acid hybridization parameters may be found in 

30 references that compile such methods, e.g. Molecular Cloning: A Laboratory Manual, J. 
Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, et al.. 
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eds., John Wiley & Sons, Inc., New York.. One example of high-stringency conditions is 
hybridization at 65°C in hybridization buffer (3.5X SSC, 0.02% Ficoll, 0.02% polyvinyl 
pyrrolidone, 0.02% Bovine Serum Albumin, 2.5mM NaEi2P04(pH7), 0.5% SDS, 2mM 
EDTA). SSC is 0.15M sodium chloride/0.1 5M sodium citrate, pH7; SDS is sodium dodecyl 
5 sulphate; and EDTA is ethylenediaminetetracetic acid. After hybridization, a membrane 
upon which the nucleic acid is transferred is washed, for example, in 2X SSC at room 
temperature and then at 0.1 - 0.5X SSC/O.IX SDS at temperatures up to 68T. 

The skilled artisan also is famihar with the methodology for screening cells for 
expression of such molecules, which then are routinely isolated, followed by isolation of the 
10 pertinent nucleic acid. Thus, homologs and alleles of the 2-0 sulfatase of the invention, as 
well as nucleic acids encoding the same, may be obtained routinely, and the invention is not 
intended to be limited to the specific sequences disclosed. It will be understood that the 
skilled artisan will be able to manipulate the conditions in a manner to permit the clear 
identification of homologs and alleles of the. 2-0 sulfatase nucleic acids of the invention. The 
15 skilled artisan also is familiar with the methodology for screening ceUs and hbraries for 
expression of such molecules which then are routinely isolated, followed by isolation of the 
pertinent nucleic acid molecule and sequencing. 

In general, homologs and alleles typically will share at least 90% nucleotide identity 
and/or at least 95% amino acid identity to the sequences of 2-0 sulfatase nucleic acids and 
20 polypeptides, respectively, in some instances v/ill share at least 95% nucleotide identity 
and/or at least 97% amino acid identity, in other instances will share at least 97% nucleotide 
identity and/or at least 98% amino acid identity, in other instances will share at least 99% 
nucleotide identity and/or at least 99% amino acid identity, and in other instances will share 
at least 99.5% nucleotide identity and^or at least 99.5% amino acid identity. The homology 
25 can be calculated using various, pubUcly available software tools developed by NCBI 
(Bethesda, Maryland) that can be obtained through the internet. Exemplary tools include the 
BLAST system available from the website of the National Center for Biotechnology 
Information (NCBI) at the National Institutes of Health. Pairwise and ClustalW aUgnments 
(BLOSUM30 matrix setting) as well as Kyte-Doohttle hydropathic analysis can be obtained 
30 using the MacVector sequence analysis software (Oxford Molecular Group). Watson-Crick 
complements of the foregoing nucleic acids also are embraced by the invention. 
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In screening for 2-0 sulfatase related genes, such as homologs and alleles of 2-0 
sulfatase, a Southern blot may be performed using the foregoing conditions, together with 
radioactive probe. After washing the membrane to which the DNA is JSnally transferred, the 
membrane can be placed against X-ray fihn or a phosphoimager plate to detect the 
5 radioactive signal. 

The recombinantly produced 2-0 sulfatase as provided herein exhibited robust, 2-0 
specific sulfatase activity. The success with expressing a highly active 2-0 sulfatase clearly 
vahdates our use of jE*. coli as a recombinant expression system for the large-scale production 
of active enzyme. Therefore, active isolated 2-0 sulfatase polypeptides (including whole 
10 proteins and partial proteins) are provided herein which include isolated 2-0 sulfatase 
polypeptides that have the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4. 

Polypeptides can be isolated from biological samples, and can also be expressed 
recombinantly in a variety of prokaryotic and eukaryotic expression systems, such as those 
described above, by constructing an expression vector appropriate to the expression system, 
15 introducing the expression vector into the expression system, and isolating the recombinantly 
expressed protein. Polypeptides can also be synthesized chemically using well-estabUshed 
methods of peptide synthesis. 

As used herein, "isolated polypeptide" means the polypeptide is separated from its 
native environment and present in suflBcient quantity to permit its identification or use. This 
20 means, for example: (i) selectively produced by expression cloning or (ii) purified as by 
chromatography or electrophoresis. Isolated proteins or polypeptides may be, but need not 
be, substantially pure. Because an isolated polypeptide may be admixed with a 
pharmaceuticaUy acceptable carrier in a pharmaceutical preparation, the polypeptide may 
comprise only a small percentage by weight of the preparation. The polypeptide is 
25 nonetheless isolated in that it has been separated from the substances with which it may be 
associated in living systems, i.e., isolated from other proteins. 

As used herein, the term "substantially pure" means that the proteins are essentially 
free of other substances to an extent practical and appropriate for their intended use. In 
particular, the proteins are sufficientiy pure and are suflficientiy free from other biological 
30 constituents of their hosts cells so as to be useful in, for example, protein sequencing, or 

producing pharmaceutical prq)arations. As used herein, a "substantially pure 2-0 sulfatase" 
is a preparation of 2-0 sulfatase which has been isolated or synthesized and which is greater 
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than about 90% free of contaminants. Preferably, the material is greater than 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or even greater than 99%fi:ee of contaminants. The 
degree of purity may be assessed by means known in the art. One method for assessing the 
purity of the material may be accomplished through the use of specific activity assays. 
5 The cloned, fiill-length gene of tiie 2-0 sulfatase encodes an open reading frame 

(ORF) of 468 amino acids (Fig. 2), with a predicted molecular mass of 5 1 .9 kDa. This 
theoretical molecular weight is approximately 10 kDa less than the value reported in the 
literature (McLean, M. W., Bmce, J. S., Long, W. F., and Williamson, F. B. (1984) Eur J 
Biochem 145(3), 607-15). Based on its amino acid composition, the encoded protein is quite 

10 basic (theoretical pi of 8.75). A further analysis of its primary amino acid sequence 

unequivocally places this ORF as a member of a larger sulfatase family. As members of a 
large enzyme family, the sulfatases hydiolyze a wide array of sulfate esters (for a review, see 
(Parenti, G., Meroni, G., and Ballabio, A. (1997) Curr Opin Genet Dev 7(3), 386-91, von 
Figura, K., Schmidt, B., Sehner, T., and Dierks, T. (1998) Bioessays 20(5), 505-10)). Their 

15 respective substrates include sulfated complex carbohydrates such as the glycosaminoglycans 
(GAGs), steroids, sphingolipids, xenobiotic compomids, and amino acids such as tyrosine. 
Additionally, many of these enzymes are able to hydrolyze in vitro smaller synthetic 
substrates (e.g., 4-nitrophenyl sulfate and catechol sulfate). It is for this reason that these 
enzymes are often generically described as "arylsulfatases" (even when their preferred in vivo 

20 substrate is ill-defined). Despite their disparate substrate specificities, the members of tins 
enzyme family share both considerable stmctural homology and a common catalytic 
mechanism with one another (Waldow, A., Schmidt, B., Dierks, T., von Bulow, R., and von 
Figura, K. (1999) J Biol Chem 274(18), 12284-8). 

The flavobacterial 2-0 sidfatase possesses considerable sequence homology to other 

25 bacterial (and non-bacterial) sulfatases, especially within its amino terminus in which resides a 
highly conserved sulfatase domain. This signature catalytic domain is readily identified by the 
consensus sequence C/SXPXRXXXXS/TG (SEQ. ID NO: 6). The conserved cysteine (or less 
commonly serine) within this sulfatase motif is of particular functional importance as it is 
covalently modified to a L-Ca- formylglycine (L-2-amino-3"Oxo-propionic acid). The 

30 ubiquitous importance of this chemical modification was first fimctionally identified by its 
relationship to the etiology of multiple sulfatase deficiency (MSD), a genetically recessive 
disorder in which there is a complete loss of sulfatase activity due to a lack of this critical 
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aldehyde (FGly) within the active site of all expressed sidfatases (Kolodny, E. H. a. F., A. L. 
(1995) in The Metabolic and Molecular Bases of Inherited Disease (Scriver, C. R., Beaudet, 
A. L., Sly, W. S., and Valle, D., ed), pp. 2693-2741, McGraw-ffill, New York). We have 
identified the conserved sulfatase active site by sequence homology which we have found 

5 includes a cysteine and not a serine as the critical amino acid predicted to be chemically 

modified as a formylglycine in vivo. An empirical demonstration of this active-site aldehyde at 
this position is presented in Examples. 

While the cloned flavobacterial sulfatase exhibits the highest sequence similarity to the 
bacterial arylsulfatases (especially the arylsulfatase from Pseudomonas aeruginosa)^ we point 

10 out that a limited homology of the 2-0 sulfatase does extend to the manmaalian 

glycosamiaoglycan sulfatases functioning in the lysosomal degradation pathway. As is the 
case for the bacterial enzymes, this sequence homology is strongest in the NHa-terminus where 
the putative sulfatase domain resides. Among the human lysosomal enzymes, it is the 
galactosamine (N-acetyl)-6-sulfate sulfatase (chondroitin 6-0 sulfatase) which exhibits the 

15 closest similarity with the flavobacterial 2-0 sulfatase; the two enzymes possess approximately 
26% identity when comparing their entire protein sequences. There are also two fimctionally 
related lysosomal sulfatases which specifically hydrolyze the 2-OH position of ^lronic acid. 
These enzymes are the iduronate 2-sulfate sulfatase (EDS) (Biehcki, J., Freeman, C, Clements, 
P. R., and Hopwood, J. J. (1990) Biochem 7271(1), 75-86) and the glucuronic-2-sulfate 

20 sulfatase (Freeman, C, and Hopwood, J. J. (1989) Biochem J 259(1% 209-16). The IDS and 
flavobacterial 2-0 sulfatase exhibit only a limited sequence homology (less than 22% identity), 
however. 

Both of these enzymes desulfate heparan sulfate, the iduronate-2-sulfate sulfatase (IDS) 
also acts on demiatan sulfate. Both enzymes possess an acidic pH optima for activity, a fact 

25 consistent with their location within the lysosome. The two sulfatases initially exist as 

precursors which must be proteolytically processed for activity. The native molecular weight 
of the human IDS precursor has been reported in the range of 42 to 65 kDa (Bielicki, J., 
Freeman, C, Clements, P. FL, and Hopwood, J. J. (1990) Biochem 7271(1), 75-86), while its 
theoretical mass based entirely on its amino acid composition is approximately 62 kDa. As 

30 such, the mammalian lysosomal IDS is somewhat larg^ than its flavobacterial counterpart, 
while also requiring substantial posttranslational modification for maximal enzyme activity. 
The acidic pH optima for the lysosomal enzymes would also appear to limit their in vitro use 
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for the determination of HSGAG composition, at least when used ixi tandem with other 
flavobacterial HSGAG degrading enzymes such as the heparinases or the A 4,5 glycuronidase; 
these latter enzymes all possess a pH optima much closer to neutrality. 

A homology-based structural model of the 2-0 sulfatase active site was constructed 

5 using as a framework the available crystallographic data for three highly related arylsulfatases. 
In this model, we have identified im^portant structural parameters within the enzyme active site 
relevant to enzyme function, especially as relates to its substrate specificity (substrate binding 
and catalysis). By docking various disaccharide substrates, we were also able to make specific 
predictions concerning structural determinants present within these potential substrates that 

10 would complement this unique active site architecture. These determinants included the 

position and number of sulfates present on the glucosamine, oligosaccharide chain length, the 
presence of a A 4,5 unsaturated double bond, and the exolytic vs. endolytic potential of the 
enzyme. These predictions were then tested against biochemical and kinetic data which largely 
validated our substrate specificity predictions. Our modeling approach was further 

15 complemented experimentally using aldehyde-specific chemical labeling, peptide mapping in 
tandem with mass spectrometry and site-directed mutagenesis to physically demonstrate the 
presence of a covalently modified cysteine (formyl glycine (FGly)) within the active site. This 
combinatorial approach of structure modeling and biochemical studies has provided insight 
into the molecular basis of enzyme function. 

20 The crystal stmctures of two human lysosomal sulfatases, cerebroside-3-sulfate 3- 

sulfohydrolase (arylsulfatase A), (Lukatela, G., Krauss, N., Theis, K., Sehner, T., Gieselmann, 
v., von Figura, K., and Saenger, W. (1998) Biochemistry 37(11), 3654-64, von Bulow, R., 
Schmidt, B., Dierks, T., von Figura, K., and Uson, L (2001) JMol Biol 305(2), 269-77) 
N-acetylgalactosamme-4-sulfate 4-siilfohydrolase (arylsulfatase B) (Bond, C. S., Clements, P. 

25 R., Ashby, S. J., CoUyer, C. A., Hairop, S. J., Hopwood, J. J., and Guss, J. M. (1997) Structure 
5(2), 277-89), and abacterial arylsulfatase from Pseudomonas aemginosa (Boltes, L, 
Czapinska, H., Kahnert, A., von Bulow, R, Dierks, T., Schmidt, B., von Figura, K., Kertesz, 
M. A., and Uson, I. (2001) Structure (Camb) 9(6), 483-91) have teen solved. These three 
sulfatases share an identical alkaline-phosphatase like structural fold (according to Structural 

30 Classification of Proteins database (www.pdb.org)) comprised of a series of mixed parallel and 
antiparallel |8 strands flanked by long and short ahehces on either side (Lukatela, G., Krauss, 
N., Theis, K., Sehner, T., Giesebnann, V., von Figura, K., and Saenger, W. (1998) 
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Biochemistry 37(11), 3654-64, Bond, C. S., Clements, P. R., Ashby, S. J., CoUyer, C. A., 
Hairop, S. J., Hopwood, J. J., and Guss, J. M. (1997) Structure 5(2), 277-89, Boltes, L, 
Czapinska, H., Kahnert, A., von Bulow, R., Dierks, T., Schmidt, B., von Figura, K., Kertesz, 
M. A., and Uson, 1. (2001) Structure (Camb) 9(6), 483-91, von Bulow, R., Schmidt, B., Dierks, 
5 T., von Figura, K., and Uson, L (2001) JMol Biol 305(2), 269-77). In addition to their 
common structural fold, these sulfatase structures also possess a high degree of homology 
within their respective active sites, especially in the region localized around the modified 
cysteine (FGly). Taken together, these crystal structures present a clear and consistent 
description of conserved active site residues at least as it relates to a hkewise conserved 

10 mechanism of sulfate ester hydrolysis. At the same time, this strong structural homology is 
somewhat surprising considering that at least two of these sulfatases act on notably different 
substrates, e.g., sulfated sphingoUpid vs. sulfated glycosaminoglycan (GAG). 

It was discovered that 2-0 sulfatase has a relatively high cysteine content. Apart from 
the catalytic cysteine at position 82, none of the remaining seven cysteines appeared to be 

15 highly conserved among other members of the sulfatase family. Enzyme activity was not 
inhibited with the addition of DTNB (Ellman's reagent) or DTT. This general lack of 
inhibition by these two cysteine-reactive agents suggests at least two probabilities. First, the 2- 
O sulfatase does not require intramolecular disulfide hnkages to critically stabiUze a 
catalytically active conformation. Second, free sulfhydryls are not direcfly participating in 

20 catalysis. It is possible, however, that a few of these cysteines are buried and therefore not 
accessible to sulfhydryl exchange. At least five of the eight cysteines, however, do react with 
DTNB under nondenaturing conditions. This latter fact suggests an alternate role for these 
solvent-accessible cysteines (along with specific histidines) ie., metal-coordinating thiolates. 
Comparison between the 2-0 sulfatase and alkaUne phosphatase reveals that these enzymes are 

25 esterases with similar catalytic mechanisms, including the presumptive formation of a co valent 
intermediate. The two hydrolytic enzymes also possess structurally related domains, m 
particular, a highly superimposible active site that includes a divalent metal binding pocket. In 
the case of alkaline phosphatase, it is zinc rather than calcium (or Mg"^^) that is coordinated 
within this pocket. 

30 The 2-0 sulfatase possesses 67 basic amino acids, including the catalytic histidine at 

position 136, a proximal lysine at position 134 and an invariant arginine at position 86 found 
within the defining sulfatase consensus sequence. Moreover, crystal structures of the active 



. PGT/US2Q04/000332 



-24- 

site of related sulfatases.each clearly show at least four basic residues participating in catalysis 
which was also found in oxir homology model. A masking of these important charges by 
exogenous ions would interfere with their catalytic function. 

Of the 8 histidines present in the flavobacterial 2-0 sulfatase, H136 is invariantly 
conserved among the structurally related bacterial sulfatases examined. For each of these 
enzymes, this highly conserved histidine is found within a putative consensus sequence 
GKWHX (SEQ. ED NO: 7) (where X is a hydrophobic amino acid). Other conserved histidines 
include His 296 and His 303. Catalytically important histidines have been observed withm the 
active site of several sulfatase crystal structures including human lysosomal N- 
acetylgalactosarmne-4 sulfatase (arylsulfatase B) (Bond, C. S., Clements, P. R., Ashby, S. J., 
CoUyer, C. A., Hatrop, S. J., Hopwood, J. J., and Guss, J. M. (1997) Structure 5(2), 277-89) 
and arylsulfatase A (Lukatela, G., Krauss, N., Theis, K., Sekier, T., Giesehnann, V., von 
Figura, K., and Saenger, W. (1998) Biochemistry 37(11), 3654-64) as well as the aiysulfatase 
&om Fseudomonas aeriginosa (Boltes, L, Czapinska, H., Kahnert, A., von Bulow, R., Dierks, 
T., Schmidt, B., von Figura, K., Kertesz, M. A., and Uson, L (2001) Structure (Camb) 9(6), 
483-91) to which the flavobacterial 2-0 sulfatase appears to be most closely related. In the 
latter case, His 21 1 appears to hydrogen bond with the sulfate oxygen (04) contributing 
perhaps to proper sulfate coordination. Additionally, theNSl of His 115 of P. aeruginosa (His 
242 in the human 4-S sulfatase) is within hydrogen bonding distance to the 0>2 of the catalytic 
formylglycine. The presence of His 136 in the active site and its participation in catalysis is 
strongly supported by our homology studies. 

The flavobacterial 2-0 sulfatase possesses 52 acidic amino acids, several of which are 
highly conserved (e.g.. Asp 42, Asp 269, Asp 286, Asp 295, and Asp 342). Interestingly, four 
acidic side chains are also found in a consensus active site also observed in known crystal 
structures. In this snapshot, these four carboxylates appear to coordinate a divalent metal ion 
(typically calcium). This divalent metal in turn coordinates with the formylglyciae hydroxylate 
and possibly the O7I group of the sulfate. 

Based on the understanding of the important residues involved in the ftmction of 2-0 
sulfatase, the invention also embraces functional variants. As used herein, a "functional 
variant" of a 2-0 sulfatase polypeptide is a polypeptide which contains one or more 
modifications to the primary amino acid sequence of a 2-0 sulfatase polypeptide. The 
polypeptide can contain 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 ,13 ,14, 15, 16, 17, 18, 19, 20, 25, 
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30, 35, 40, 50 or more amino acid modijScations. These modifications are intended to 
encompass modifications that result in a 2-0 sulfatase with altered activity relative to the 
native 2-0 sulfatase but also include modifications that do not result in altered activity 
relative to the native enzyme. The term ^'native" as used herein refers to the 2-0 sulfatase as 

5 it would be found in nature. Modifications which create a 2-0 sulfatase polypeptide 
fimctional variant are typically made to the nucleic acid which encodes the 2-0 sulfatase 
polypeptide, and can include deletions, point mutations, truncations, amino acid substitutions 
and addition of amino acids or non-amino acid moieties to: 1) enhance a property of a 2-0 
sulfatase polypeptide, such as protein stabihty in an expression system or the stabihty of 

10 protein-protein binding; 2) provide a novel activity or property to a 2-0 sxilfatase polypeptide, 
such as addition of a detectable moiety; or 3) to provide equivalent or better interaction with 
other molecules (e.g., heparin). Alternatively, modifications can be made directly to the 
polypeptide, such as by cleavage, addition of a linker molecule, addition of a detectable 
moiety, such as biotin, addition of a fatty acid, and the like. Modifications also embrace 

15 fiision proteins comprising all or part of the 2-0 sulfatase amino acid sequence. One of skill 
in the art will be familiar with methods for predicting the effect on protein conformation of a 
change in protein sequence, and can thus "design" a fimctional variant 2-0 sulfatase 
polypeptide according to known methods. One example of such a method is described by 
Dahiyat and Mayo in Science 278:82-87, 1997, whereby proteins can be designed de novo, 

20 The method can be ^phed to a known protein to vary only a portion of the polypeptide 

sequence. By applying the computational methods of Dahiyat and Mayo, specific variants of 
a polypeptide can be proposed and tested to determine whether the variant retains a desired 
conformation. 

Functional variants can include 2-0 sulfatase polypeptides which are modified 
25 specifically to alter a feature of the polypeptide unrelated to its physiological activity. For 
example, cysteine residues can be substituted or deleted to prevent unwanted disulfide 
linkages. Similarly, certain amino acids can be changed to enhance expression of a 2-0 
sulfatase polypeptide by eliminating proteolysis by proteases in an expression system (e.g., 
dibasic amino acid residues in yeast expression systems in which KEX2 protease activity is 
30 present). Functional variants, therefore, can also include variant 2-0 sulfatase that maintain 
the same enzymatic fimction as the native 2-0 sulfatase but include some modification to the 
amino acid sequence that does not alter native enzyme activity. These modifications include 
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conservative amino acid substitutions as well as non-conservative amino acid substitutions 
thiat are remote &om the binding and catalytic sites of the enzyme. 

Mutations of a nucleic acid which encodes a 2-0 sulfatase polypeptide preferably 
preserve the amino acid reading frame of the coding sequence, and preferably do not create 
5 regions in the nucleic acid which are likely to hybridize to form secondary structures, such as 
hairpins or loops, which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to detemiine which 
10 mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant 2-0 sulfatase polypeptides) which are silent as to the 
amino acid sequence of the polypeptide, but which provide preferred codons for translation in 
a particular host. The preferred codons for translation of a nucleic acid in, e.g., E. coli^ are 
well known to those of ordinary skill in the art. Still other mutations can be made to the 
15 noncoding sequences of a 2-0 sulfatase gene or cDNA clone to enhance expression of the 
polypeptide. 

In the description that follows, reference vtiU be made to the amino acid residues and 
residue positions of native 2-0 sulfatase disclosed in SEQ ID NO: 1 . In particular, residues 
and residue positions will be referred to as "corresponding to" a particular residue or residue 

20 position of 2-0 sulfatase. As will be obvious to one of ordinary skill in the art, these 

positions are relative and, therefore, insertions or deletions of one or more residues would 
bave the effect of altering the numbering of downstream residues. In particular, N-terminal 
insertions or deletions would alter the numbering of all subsequent residues. Therefore, as 
used herein, a residue in a recombinant modified heparinase will be referred to as 

25 ''corresponding to" a residue of the full 2-0 sulfatase if, using standard sequence comparison 
programs, they would be aligned. Many such sequence ahgnment programs are now 
available to one of ordinary skill in the art and their use in sequence comparisons has become 
standard (e.g., "LALIGN" available via tiie Internet at http://phaedra.crbm.cnrs- 
niop.fr/fasta/lalign-query.html). As used herein, this convention of referring to tiie positions 

30 of residues of the recombinant modified heparinases by their corresponding 2-0 sulfatase 
residues shall extend not only to embodiments including N-terminal insertions or deletions 
but also to internal insertions or deletions (e.g, insertions or deletions in "loop" regions). 



- - wo 2004/062592- - ^/'is^m..- ^T/ifs: . „PCTAJS2004/000332- 

-27- 

One type of amino acid substitution is referred to as a "conservative substitution." As 
used herein, a "conservative amino acid substitution" or "conservative substitution" refers to 
an amino acid substitution in which the substituted amino acid residue is of similar charge as 
the replaced residue and is of similar or smaller size than the replaced residue. Conservative 
5 substitutions of amino acids include substitutions made amongst amino acids within the 
following groups: (a) the small non-polar amino acids. A, M, I, L, and V; (b) the small polar 
amino acids, G, S, T and C; (c) the amido amino acids, Q and N; (d) the aromatic annno 
acids, F, Y and W; (e) the basic amino acids, K, R and H; and (f) the acidic amino acids, E 
and D. Substitutions which are charge neutral and which replace a residue with a smaller 
10 residue may also be considered "conservative substitutions" even if the residues are in 

different groups (e.g., replacement of phenylalanine with the smaller isoleucine). The term 
"conservative amino acid substitution" also refers to the use of amino acid analogs. 

Methods for making amino acid substitutions, additions or deletions are well known 
in the art. The terms "conservative substitution", ^'non-conservative substitutions", "non- 
15 polar amino acids", "polar anodno acids", and "acidic amino acids" are all used consistently 
with the prior art terminology. Each of these terms is well-known in the art and has been 
extensively described in numerous publications, including standard biochemistry text books, 
such as "Biochemistry" by Geoffrey Zubay, Addison-Wesley Publishing Co., 1986 edition, 
which describes conservative and non-conservative substitutions, and properties of amino 
20 acids which lead to their definition as polar, non-polar or acidic. 

Even when it is difficult to predict the exact effect of a substitution in advance of 
doing so, one skilled in the art will appreciate that the effect can be evaluated by routine 
screening assays, preferably the biological assays described herein. Modifications of peptide 
properties including thermal stabiUty, enzymatic activity, hydrophobicity, susceptibihty to 
25 proteolytic degradation or the tendency to aggregate with carriers or into multimers are 
assayed by methods well known to the ordinarily skilled artisan. For additional detailed 
description of protein chemistry and structure, see Schulz, G. E. et al.. Principles of Protein 
Stmcture, Springer- Verlag, New York, 1979, and Creighton, T. E., Proteins: Structure and 
Molecular Principles, W. H, Freeman & Co., San Francisco, 1984. 
30 Additionally, some of the amino acid substitutions are non-conservative substitutions. 

In certain embodiments where the substitution is remote from the active or binding sites, the 
non-conservative substitutions are easily tolerated provided that they preserve a tertiary 
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structure characteristic of, or similar to, native 2-0 sulfatase, thereby preserving the active 
aad binding sites. Non-conservative substitutions, such as between, rather than within, the 
above groups (or two other amino acid groups not shown above), which will diflFer more 
significantly in their effect on maintaining (a) the structure of the peptide backbone in the 
5 area of the substitution (b) the charge or hydrophobicity of the molecule at the target site, or 
(c) the bulk of the side chain. 

Nearly every active, recombinantly expressed sulfatase reported in the hterature 
possesses a cysteine (and not a serine) within the active site sequence C/SXPXRXXXXS/TG 
(SEQ. ED NO: 6) (Lukatela, G., Krauss, N., Theis, K., Sehner, T., Giesehnann, V., von Figura, 
10 K., and Saenger, W. (1998) Biochemistry 37(1 1), 3654-64). It seemed likely, therefore, that a 
cysteine-specific modifymg machinery functionally exists in E. coli. This idea was supported 
by our initial attempts to produce a recombinant cysteine-* serine 2-0 sulfatase variant which 
led to the production of insoluble protein when expressed in E. coli. We note that the E. coli 
genome encodes for at least three different putative sulfatase genes in addition to the atsB gene 
15 which, by homology, has been proposed to encode for this cysteine-specific modifying activity. 
All of these genes are located as a cluster within the bacterial chromosome (Kertesz, M. A. 
(^Qm)FEMSMicrohiolRev2A{l\ 135-75). It would appear, however, that the £. co/z 
sijlfatase genes are normally cryptic. At the very least, E. coli lacks the specific enzymes for 
desulfating heparin/heparan sulfate glycosaminoglycans, but the bacterium fortuitously 
20 provides the necessary enzymology to effectively modify select heterologous sulfatases such as 
the 2-0 sulfatase. Therefore, the 2-0 sulfatases as described herein can be produced 
recombinantly in E. coli. However, the recombinant production of the 2-0 sulfatases provided 
are not Umited to their expression in E, coli. The 2-0 sulfatases can also be recombinantly 
produced in other expression systems described below. 
25 The 2-0 sulfatases, can be recombinantly produced using a vector including a coding 

sequence operably joined to one or more regulatory sequences. As used herein, a coding 
sequence and regulatory sequences are said to be "operably joined" when they are covalently 
linked in snch a way as to place the expression or transcription of the coding sequence under 
the influence or control of the regulatory sequences. If it is desked that the coding sequences 
30 be translated into a functional protein the coding sequences are operably joined to regulatory 
sequences- Two DNA sequences are said to be operably joined if induction of apromoter in 
the 5* regulatory sequences results in the transcription of the coding sequence and if the nature 
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of the linkage between the two DNA sequences does not (1) result in the introduction of a 
frame-shift mutation, (2) interfere with the ability of the promoter region to direct the 
transcription of the coding sequences, or (3) interfere with the ability of the corresponding 
RNA transcript to be translated into a protein. Thus, a promoter region would be operably 
joined to a coding sequence if the promoter region were capable of effecting transcription of 
that DNA sequence such that the resulting transcript might be translated into the desired 
protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribing 
and 5* non-translating sequences involved with initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the Uke. 
Especially, such 5' non-transcribing regulatory sequences will include a promoter region 
which includes a promoter sequence for transcriptional control of the operably joined gene. 
Promoters may be constitutive or inducible. Regulatory sequences may also include 
enhancer sequences or upstream activator sequences, as desired. 

As used herein, a *'vector" may be any of a number of nucleic acids into which a 
desired sequence may be inserted by restriction and Ugation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of 
DNA although RNA vectors are also available. Vectors include, but are not limited to, 
plasmids and phagemids. A cloning vector is one which is able to rephcate in a host cell, and 
which is further characterized by one or more endonuclease restriction sites at which the 
vector may be cut in a determinable fashion and into which a desired DNA sequence may be 
ligated such that the new recombinant vector retains its abiUty to rephcate in the host cell. In 
the case of plasmids, replication of the desired sequence may occur many times as the 
plasmid increases in copy number within the host bacterium, or just a single time per host as 
the host reproduces by mitosis. In the case of phage, replication may occur actively during a 
lytic phase or passively during a lysogenic phase. An expression vector is one into which a 
desired DNA sequence may be inserted by restriction and Ugation such that it is operably 
joined to regulatory sequences and may be expressed as an RNA transcript. Vectors may 
further contain one or more marker sequences suitable for use in the identification of cells 
which have or have not been transformed or transfected with the vector. Markers include, for 
example, genes encoding proteins which increase or decrease either resistance or sensitivity 
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to antibiotics or other compoimds, genes which encode enzymes whose activities are 
detectable by standard assays known in the art (e.g., fi-galactosidase or alkaline phosphatase), 
and genes which visibly affect the phenotype of transformed or transfected cells, hosts, 
colonies or plaques. Preferred vectors are those capable of autonomous replication and 
5 expression of the structural gene products present in the DNA segments to which they are 
operably joined. 

For prokaryotic systems, plasmid vectors that contain replication sites and control 
sequences derived jfrom a species compatible with the host may be used. Examples of 
suitable plasmid vectors include pBR322, pUC18, pUC19 and the like; suitable phage or 
10 bacteriophage vectors include X.gtlO, Xgtl 1 and the Uke; and suitable virus vectors iuclude 
pMAM-neo, pKRC and the hke. Preferably, the selected vector of the present invention has 
the capacity to autonomously replicate in the selected host cell. Usefiil prokaryotic hosts 
include bacteria, in addition to E. colU Flavobacterium heparinum, Bacillus, Streptomyces, 
Pseudomonas, Salmonella, Serratia, and the like. 
15 To express the 2-0 sulfatase of the invention in a prokaryotic cell, it is desirable to 

operably join the nucleic acid sequence of a 2-0 sulfatase of the invention to a functional 
prokaryotic promoter. Such promoter may be either constitutive or, more preferably, 
regulatable (i.e., inducible or derepressible). Examples of constitutive promoters include the 
int promoter of bacteriophage X, the bla promoter of the p-lactamase gene sequence of 
20 pBR322, and the CAT promoter of the chloramphenicol acetyl transferase gene sequence of 
pPR325, and the like. Examples of inducible prokaryotic promoters include the major right 
and left promoters of bacteriophage X (Pl and Pr), the trp, recA, lacZ, lad and gal promoters 
ofE. co/z, fhea-amylase(Ulmanenetal.,J.jBac^eri^^^ 162:176-182 (1985)) and the (;-28- 
specific promoters of jB. subtilis (Oilman et al.. Gene sequence 32:1 1-20 (1984)), the 
25 promoters of the bacteriophages of Bacillus (Gryczan, In: Tlie Molecular Biology of the 
Bacilli^ Academic Press, hic, NY (1 982)), and Streptomyces promoters (W ard et al., Mol 
Gen. Genet 203:468-478 (1986)). 

Prokaryotic promoters are reviewed by Click (J. Ind. Microbiol 1 'lll-lXl (1987)); 
Cenatiempo (Biochimie 68:505-516 (1986)); and Gottesman (Ann, Rev, Genet 18:415-442 
30 (1984)). 
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Proper expression in a prokaryotic cell also requires the presence of a ribosome 
binding site upstream of the encoding sequence. Such ribosome binding sites are disclosed, 
for example, by Gold et al. (Ann, Rev. Microbiol 35:365-404 (1981)). 

Because prokaryotic cells may not produce the 2-0 sulfatase of the invention with 

5 normal eukaryotic glycosylation, expression of the 2-0 sulfatase of the invention of the 
eukaryotic hosts is useful when glycosylation is desired. Preferred eukaryotic hosts include, 
for example, yeast, fungi, insect cells, and mammalian cells, either in vivo or in tissue culture. 
Mammalian cells which may be useful as hosts include HeLa cells, cells of fibroblast origin 
such as VERO or CHO-Kl, or cells of lymphoid origin, such as the hybridoma SP2/0-AG14 

10 or the myeloma P3x63Sg8, and their derivatives. Preferred mammalian host cells include 
SP2/0 and J558L, as well as neuroblastoma cell lines such as IMR 332 that may provide 
better capacities for correct post-tr^nslational processing. Embryonic cells and mature cells 
of a transplantable organ also are useful according to some aspects of the invention. 

In addition, plant cells are also available as hosts, and control sequences compatible 

15 with plant cells are available, such as the nopahne synthase promoter and polyadenylation 
signal sequences. 

Another preferred host is an insect cell, for example in Drosophila larvae. Using 
insect cells as hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, 
Science 240:1453-1459 (1988)). Altematively, baculovirus vectors can be engineered to 

20 express large amounts of the 2-0 sulfatase of the invention in insect cells (Jasny, Science 
238:1653 (1987); Miller et al.. In: Genetic Engineering (19B6), Setlow, J.K., et al., eds., 
Plenum, Vol. 8, pp. 277-297). 

Any of a series of yeast gene sequence expression systems which incorporate 
promoter and termination elements from the genes coding for glycolytic enzymes and which 

25 are produced in large quantities when the yeast are grown in media rich in glucose may also 
be utilized. Knovra glycolytic gene sequences can also provide very efficient transcriptional 
control signals. Yeast provide substantial advantages in that they can also carry out post- 
translational peptide modifications. A number of recombinant DNA strategies exist which 
utilize strong promoter sequences and high copy number plasmids which can be utilized for 

30 production of the desired proteins in yeast. Yeast recognize leader sequences on cloned 

mammalian geno sequence products and secrete peptides bearing leader sequences (i.e., pre- 
peptides). 
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A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine papilloma 
virus, simian virus, or the like, where the regulatory signals are associated with a particular 
5 gene sequence which has a high level of expression. Altematively, promoters from 
mammalian expression products, such as actin, collagen, myosin, and the like, may be 
employed. Transcriptional initiation regulatory signals may be selected which allow for 
repression or activation, so that expression of the gene sequences can be modulated. Of 
interest are regulatory signals that are temperature-sensitive so that by varying the 
10 temperature, expression can be repressed or initiated, or which are subject to chemical (such 
as metabohte) regulation. 

As discussed above, expression of the 2-0 sulfatase of the invention in eukaryotic 
hosts requires the use of eukaryotic regulatory regions. Such regions will, in general, include 
a promoter region sufficient to direct the initiation of RNA synthesis. Preferred exikaryotic 
15 promoters include, for example, the promoter of the mouse metallofhionein I gene sequence 
(Hamer et al., /. Mol Appl Gen. 1:273-288 (1982)); the TK promoter of Herpes virus 
(McKnight, Cell 31:355-365 (1982)); the SV40 early promoter (Benoist et al.. Nature 
(London) 290:304-310 (1981)); the yeast gaW gene sequence promoter (Johnston et al., Proa 
Natl Acad. ScL (USA) 79:6971-6975 (1982); Silver et al.. Proa Natl Acad. Sou (USA) 
20 81:5951-5955 (1984)). 

As is widely known, translation of eukaryotic mRNA is initiated at the codon which 
encodes the first methionine. For tins reason, it is preferable to ensure that the linkage 
between a eukaryotic promoter and a DNA sequence which encodes the 2-0 sulfatase of the 
invention does not contain any interveiung codons which are capable of encoding a 
25 methionine (i.e., AUG). The presence of such codons results either in the formation of a 
fusion protein (if the AUG codon is in the same reading frame as the 2-0 sulfatase of the 
invention coding sequence) or a frame-shift mutation (if the AUG codon is not in the same 
reading frame as the 2-0 sulfatase of the invention coding sequence). 

In one embodiment, a vector is employed which is capable of integrating the desired 
30 gene sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can he selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
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marker may, for example, provide for prototrophy to aa auxotrophic host or may confer 
biocide resistance to, e.g., antibiotics, heavy metals, or the like. The selectable marker gene 
sequence can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed for 
5 optimal synthesis of the 2-0 sulfatase mRNA. These elements may include splice signals, as 
well as transcription promoters, enhancers, and termination signals. cDNA expression 
vectors incorporating such elements include those described by Okayama, Molea Cell Biol. 
3:280 (1983). 

In another embodiment, the introduced sequence will be incorporated into a plasmid 

10 or viral vector capable of autonomous repUcation in the recipient host. Any of a wide variety 
of vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular host; and whether it is 

15 desirable to be able to "shuttle" the vector between host cells of different species. Preferred 
prokaryotic vectors include plasmids such as those capable of replication in coli (such as, 
for example, pBR322, ColEl, pSClOl, pACYC 184, and tiVX). Such plasmids are, for 
example, disclosed by Sambrook, et al. {Molecular Cloning: A Laboratory Manual, second 
edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989)). 

20 Bacillus plasmids include pC194, pC221, pT127, and the like. Such plasmids are disclosed 
by Gryczan (In: The Molecular Biology of the Bacilli, Academic Press, NY (1982), pp. 307- 
329). Suitable Streptomyces plasmids include pUlOl (Kendall et al., 1 Bactenol 169:4177- 
4183 (1987)), and streptomyces bacteriophages such as (j)C31 (Chater et al., In: Sixth 
International Symposium on Actinomycetales Biology, Akademiai Kaido, Budapest, Hungary 

25 (1986), pp. 45-54). Pseudomonas plasmids are reviewed by John et al. {Rev. Infect Dis. 
8:693-704 (1986)), and Izaki (Jpn. 1 Bactenol 33:729-742 (1978)). 

Preferred eukaryotic plasmids include, for example, BPV, EBV, SV40, 2-micron 
circle, and the like, or their derivatives. Such plasmids are well known in the art (Botstein et 
al,, Miami Wntr. Symp, 19:265-274 (1982); Broach, In: Tlte Molecular Biology of the Yeast 

30 Saccharomyces: Life Cycle and Inheritance, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, p. 445-470 (1981); Broach, Cell 28:203-204 (1982); Bollon et al., J. Clin. 
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Hematol Oncol 10:39-48 (1980); Maniatis, In: Cell Biology: A Comprehensive Treatise, 
Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 563-608 (1980)). Other 
preferred eukaryotic vectors are viral vectors. For example, and not by way of limitation, the 
pox virus, herpes virus, adenovirus and various retro viiruses may be employed. The viral 
vectors may include either DNA or KNA viruses to cause expression of the insert DNA or 
insert KNA. 

Once the vector or DNA sequence containing the construct(s) has been prepared for 
expression, the DNA constmct(s) may be introduced into an appropriate host cell by any of a 
variety of suitable means, i.e., transformation, transfection, conjugation, protoplast fiision, 
electroporation, calcium phosphate-precipitation, direct microinjection, and the like. 
Additionally, DNA or RNA encoding the 2-0 sulfatase of the invention may be directly 
injected into cells or may be impelled through cell membranes after being adhered to 
microparticles. After the introduction of the vector, recipient cells are grown in a selective 
medium, which selects for the growth of vector-containing cells. Expression of the cloned 
gene sequence(s) results in the production of the 2-0 sulfatase of the invention. This can take 
place in the transformed cells as such, or following the induction of these cells to differentiate 
(for example, by administration of bromodeoxyuracil to neuroblastoma cells or the like). 

One of skill in the art may also substitute appropriate codons to produce the desired 
amino acid substitutions in SEQ ID NOs: 2 or 4 by standard site-directed mutagenesis 
techniques. One may also use any sequence which differs from the nucleic acid equivalents of 
SEQ ID NO: 2 or 4 only due to the degeneracy of the genetic code as the starting point for site 
directed mutagenesis. The mutated nucleic acid sequence may then be ligated into an 
appropriate expression vector and expressed in a host such as E, colL 

Our initial assessment of 2-0 sulfatase activity was based upon the use of a few select 
unsaturated heparin disaccharide substrates. Desulfation was imequivocally specijSc for the 
2-0 position (Fig. 5). This substrate discrimination was based on the extent of sulfation and 
largely manifested as a Km effect. In particular, the presence of a 6-0 sulfate on the adjoining 
glucosamine conferred a significantly lower Km relative to its counterpart lacking such a 
sulfate ester. In terms of catalytic efficiency, the trisulfated disaccharide CAU2sHns,6s) was 
the more efficient substrate whereas the mono-sulfated disaccharide (AU2sHnac) was less 
efficient 
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The 2-0 sulfated chondroitin disaccharide AU2sGal>,Ac,6s, however, was also, albeit 
neglibly, hydrolyzed under the same kinetic conditions. The enzyme did desulfate this 
disaccharide to an appreciable extent, however, under reaction conditions involving a 4X 
higher enzyme concentration and a longer incubation time. Under these conditions, 
5 approximately 40% of the substrate was desulfated over a 20 minute period. In contrast, less 
than 10% of chondroitin disaccharide AU2sGalNAc,4s was hydrolyzed during the same time 
period. Under exhaustive conditions, both chondroitin disaccharides were greater than 95% 
desulfated at the 2-0 position. The appdxent kinetic discrimination points to an underlyiog 
structural determinant, namely a preference for glucosamine sulfated at the 6-OH and 2N 
10 positions. 

In addition, examination of the biochemical conditions for optimal enzymatic activity 
yielded several observations. First, 2-0 sulfatase activity exhibited a pH profile with a 
narrower pH range (6.0-7.0) in which the enzyme was most active. The enzyme exhibited 
maxhnal catalytic efficiency at pH 6.5 with essentially no activity observed at the outlying pH 

15 values of 5 and 8 (Fig. 6, Panel (A)). A sharply defined pH optima of 6.5 implicates a catalytic 
role of one or more histidines. Second, the observed NaCl titration profile (Fig. 6, Panel (B)) 
demonstrates a clearly inhibitory effect of ionic strength on sulfatase activity, even at relatively 
low NaCl concentrations. That is, while 50% inhibition occurred in the presence of 
approximately 200 mM NaCl, even 100 mM NaCl was shghtiy inhibitory to 2-0 sulfatase 

20 activity. This is a rather sharp activity transition for both the A 4,5 glycuronidase and other 
recombinantly expressed K heparinum GAG degrading enzymes. The correlation between 
activity and the ionic buffer composition is reasonable, given the anionic character of the 
^ saccharide substrates conferred by both the presence of sulfates and the uronic acid 
carboxylates within each disaccharide unit. For the 2-0 sulfatase in particular, charge 

25 interactions between basic side chains and the sulfate oxygen anion may be involved in 
substrate orientation. 

The results described herein suggest that the 2-0 sulfatase activity is upstream fi-om the 
hydrolysis of the unsaturated uronic acid by the A 4,5 glycuronidase. This scenario would also 
make the 2-0 sulfatase a so-called "early" enzyme in the HSGAG degradation pathway that 
30 occurs in ^nvo. The substrate-product correlation between the 2-0 sulfatase and the A 4,5 
glycuronidase has been demonstrated with the two experiments summarized in Figs. 7 and 8. 
Fig. 8 in particular demonstrates how these two enzymes (along with the heparinases) can be 
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nsed in tandem as analytical tools for HSGAG compositional analyses. The results have 
demonstrated the utility of the sulfatase as a tool for probing HSGAG composition, especially 
when the enzyme is used in tandem with the A 4,5 glycuronidase. 

The present invention provides for the use of 2-0 sulfatase as an enzymatic tool due 
5 to its substrate specificity and specific activity. As described herein, it was found that the 
activity of the cloned enzyme is not compromised by its recombinant expression in E. colL 
The ^'native 2-0 sulfatase specific activity* is the measure of enzymatic activity of the native 
2-0 sulfatase obtained firom cell lysates of F. heparinum also described in the Examples 
below. Therefore, based on the disclosure provided herein, those of ordinary skill in the art 

10 will be able to identify other 2-0 sulfatases having altered enzymatic activity with respect to 
the native 2-0 sulfatase such as functional variants. 

The term "specific activity* as used herein refers to the enzymatic activity of a 
preparation of 2-0 sulfatase. In general, it is preferred that the substantially pure and/or 
isolated 2-0 sulfatase preparations of the invention have a specific activity of at least about 7 

15 nanomoles of substrate (DiS) hydrolized per minute per microgram of enzyme. It also 

generally more preferred that the substantially pure and/or isolated 2-0 sulfatase preparations 
of the invention have a speciac activity of at least about 40 nanomoles of substrate piS) 
hydrolized per minute per microgram of enzyme. As provided herein, the recombinant 2-0 
sulfatase purified by (nickel chromatography with the histidine tag) was found to have an 

20 about six-fold higher specific activity than native 2-0 sulfatase. The recombinant 2-0 

sulfatase without the histidine tag was found to have an about ten-fold higher specific activity 
than the native 2-0 sulfatase. Therefore, in one aspect of the hivention preparations of 2-0 
sulfatase with about a 5-, 6-, 7-, 8-, 9-, 10-, 1 1-, 1 2-, 13-, 14-, 1 5-, 20-, 25-, and 30- fold 
specific activity are provided. 

25 The invention, therefore, provides for the degradation of glycosammoglycans using 

the 2-0 sulfatase described herein. The 2-0 sulfatase of the invention may be used to 
specifically degrade an HSGAG by contacting the HSGAG substrate witii the 2-0 sulfatase 
of the invention. The invention is useful in a variety of z>z vitro, in vivo and ex yivo methods 
in which it is useful to degrade HSGAGs. 

30 As used herein the terms 'TISGAG" and "glycosammoglycan** and "GAG" are used 

interchangeably to refer to a family of molecules having heparin-like/heparan suhfate-like 
structures and properties. These molecules include but are not hmited to low molecular 
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weight heparin (LMWH), heparin, biotechnologically prepared heparin, chemically modified 
heparin, synthetic heparin, and heparan sulfate. The term "biotechnological heparin" 
encompasses heparin that is prepared from natural sources of polysaccharides which have 
been chemically modified and is described for example in Razi et al., Bioche. J. 1995 Jul 
5 15;309 (Pt 2): 465-72. Chemically modified heparin is described in Yates et al., 
Carbohydrate Res (1996) Nov 20;294:15-27, and is known to those of skill in the art. 
Synthetic heparin is well known to those of skill in the art and is described in Petitou, M. et 
al., Bioorg Med Chem Lett. (1999) Apr 19;9(8):1161-6. 

Analysis of a sample of glycosaminoglycans is also possible with 2-0 sulfatase alone 

10 or in conjunction with other enzymes. Other HSGAG degrading enzymes include but are not 
limited to heparinase-I, heparinase- n , heparinase-IE, A 4, 5 glycuronidase, other sulfatases, 
modified versions of the enzymes, variants and fimctionally active fragments thereof. In 
particular, 2-0 sulfatase can be used subsequent to or concomitantly with a heparinase to 
degrade a glycosamiaoglycan. In addition 2-0 sulfatase may be used prior to and also 

15 concomitantly with A 4, 5 glycuronidase. 

The methods that may be used to test the specific activity of 2-0 sulfatase of the 
present invention are known in the art, e.g., those described in the Examples. These methods 
may also be used to assess the function of variants and fimctionally active Segments of 2-0 
sulfatase. The kcat value may be determined using any enzymatic activity assay to assess the 

20 activity of a 2-0 sulfatase enzyme. Several such assays are well-known in the art. For 

iostance, an assay for measuring kcat is described in Ernst, S. E., Venkataraman, G., Winkler, 
S., Godavarti, R., Langer, IL, Cooney, C. and Sasisekharan. R. (1996) Biochem. J. 315, 589- 
597. Therefore, based on the disclosure provided herein, those of ordinary skill in the art will 
be able to identify other 2-0 sulfatase molecules having enzymatic activity that is similar to 

25 or altered ia comparison witii the native 2-0 sulfatase molecule such as 2-0 sulfatase 
fimctional variants. 

Due to the activity of 2-0 sulfatase on glycosaminoglycans, the product profile 
produced by a 2-0 sulfatase may be determined by any method known in the art for 
examining the type or quantity of degradation products produced by 2-0 sulfatase alone or in 
30 combination with other enzymes. One of skill in the art will also recognize that the 2-0 
sulfatase may also be used to assess the purity of glycosaminoglycans in a sample. One 
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preferred method for determining the type and quantity of product is described in Rhomberg, 
AJ. et aL, PNAS, v. 95, p. 4176-4181, (April 1998), which is hereby incorporated in its 
entirety by reference. The method disclosed in the Rhomberg reference utilizes a 
combination of mass spectrometry and capillary electrophoretic techniques to identify the 
5 emymatic products produced by heparinase. The Rhomberg study utilizes heparinase to 
degrade HSGAGs to produce HSGAG oligosaccharides. MALDI (Matrix-Assisted Laser 
Desorption Ionization) mass spectrometry can be used for the identification and 
semiquantitative measurement of substrates, enzymes, and end products in the enzymatic 
reaction. The capillary electrophoresis technique separates the products to resolve even small 

10 differences amongst the products and is applied in combination with mass spectrometry to 
quantitate the products produced. Capillaiy electrophoresis may even resolve the difference 
between a disaccharide and its semicarbazone derivative. Detailed methods for sequencing 
polysaccharides and other polymers are disclosed in co-pending U.S. Patent Applications 
Serial Nos. 09/557,997 and 09/558,137, both filed on April 24, 20OO and having common 

15 inventorship. The entire contents of both apphcations are hereby incorporated by reference. 
Briefly, the method is performed by enzymatic digestion, followed by mass 
spectrometry and capillary electrophoresis. The enzymatic assays can be performed in a 
variety of manners, as long as the assays axe performed identically on the HSGAG, so that the 
results may be compared. In the example described in the Rhomberg reference, enzymatic 

20 reactions are performed by adding 1 mL of enzyme solution to 5 mL of substrate solution. 
The digestion is then carried out at room temperature (22''C), and the reaction is stopped at 
various time points by removing .0.5 mJL of the reaction mixture and adding it to 4.5 mL of a 
MALDI matrix solution, such as caffeic acid (approximately 12 mg/mL) and 70% 
acetonitrile/water. The reaction mixture is then subjected to MALDI mass spectrometry. 

25 The MALDI surface is prepared by the method of Xiang and Beavis pCiang and Beavis 
{199^) Rapid. Commun, Mass. Spectrom. 8, 199-204). A two-fold lower access of basic 
peptide (Arg/Gly)i5 is premixed with matrix before being added to the oKgosaccharide 
solution. A 1 mL aliquot of sample/matrix mixture containing 1-3 picomoles of 
oligosaccharide is deposited on the surface. After crystallization occurs (typically within 60 

30 seconds), excess liquid is rinsed off with water. MALDI mass spectrometry spectra is then 
acquired in the Hnear mode by usmg a PerSeptive Biosystems (Framingham, MA) Voyager 
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Elite reflectron time-of-flight instrument fitted with a 337 nanometer nitrogen laser. Delayed 
extraction is used to increase resolution (22 kV, grid at 93%, guidewire at 0.15%, piilse delay 
150 ns, low mass gate at 1,000, 128 shots averaged). Mass spectra may be caUbrated 
externally by using the signals for proteinated (Arg/Gly)i5 and its complex with the 
5 oligosaccharide. 

Capillary electrophoresis may then be performed on a Hewlett-Packard^^ CE unit by 
xising uncoated fused silica c^illaries (internal diameter 75 micrometers, outer diameter 363 
micrometers, Idet 72.1 cm, and Itot 85 cm). Analytes are monitored by using UV detection at 
230 nm and an extended light path cell (Hewlett-Packard). The electrolyte is a solution of 10 

10 mL dextran sulfate and 50 millimolar Tris/phosphoric acid (pH2.5). Dextran sulfate is used 
to suppress nonspecific interactions of the heparin ohgosaccharides with a siUca wall. 
Separations are carried out at 30 kV with the anode at the detector side (reversed polarity). A 
mixture of a 1/5-naphtalenedisulfonic acid and 2-naphtalenesulfonic acid (10 micromolar 
each) is used as an internal standard. 

15 Other methods for assessing the product profile may also be utilized. For instance, 

other metihiods include methods which rely on parameters such as viscosity (Jandik, K. A., Gu, 
K. and Linhardt, R.J., (1994), Glycobiology, 4:284-296) or total UV absorbance (Ernst, S. et 
al., (1996), Biochem, J., 315:589-597) or mass spectrometry or capillary electrophoresis 
alone. 

20 The 2-0 sulfatase molecules of the invention are also usefiil as tools for sequencing 

HSGAGs. Detailed methods for sequencing polysaccharides and other polymers are 
disclosed in co-pending U.S. Patent AppHcations Serial Nos. 09/557,997 and 09/558,137, 
both filed on April 24, 2000 and having common inventorship. These methods utilize tools 
such as heparinases in the sequencing process. The 2-0 sulfatase of the invention is useful as 

25 such a tool. 

2-0 sulfatase as well as the combinations of 2-0 sulfatase with other enzymes can, 
therefore, be used in any method of analyzing HSGAGs. In addition, these enzymes as 
described can be used to determine the presence of a particular glycosaminoglycan in a 
sample or the conaposition of a glycosaminoglycans in a sample. A "sample", as used herein, 
30 refers to any sample that may contain a GAG. 

One of ordinary skill in the art, in fight of the present disclosure, is enabled to produce 
substantially pure preparations of HSGAG and/or GAG firagment compositions utilizing the 
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2-0 sulfatase molecules alone or in conjunction with other enzymes. The GAG fragment 
preparations are prepared from HSGAG sources. A ''HSGAG source" as used herein refers 
to heparin-like/heparan sulfate-like glycosaminoglycan composition which can be 
manipulated to produce GAG fragments using standard technology, including enzymatic 

5 degradation etc. As described ahove, HSGAGs include but are not limited to isolated 
heparin, chemically modified heparin, biotechnology prepared heparin, synthetic heparin, 
heparan sulfate, and LMWH. Thus HSGAGs can be isolated from natural sources, prepared 
by direct synthesis, mutagenesis, etc. 

The 2-0 sulfatase is, in some embodiments, immobilized on a support. The 2-0 

10 sulfatase may be immobilized to any type of support but if the support is to be used in vivo or 
ex vivo it is desired that the support is sterile and biocompatible. A biocompatible support is 
one which would not cause an immune or other type of damaging reaction when used in a 
subject. The 2-0 sulfatase may be immobilized by any method known in the art. Many 
methods are known for immobilizing proteius to supports. A "sohd support" as used herera 

15 refers to any sohd material to which a polypeptide can he immobilized. 

Solid supports, for example, include but are not limited to membranes, e.g., natural 
and modified celluloses such as nitrocellulose or nylon, Sepharose, Agarose, glass, 
polystyrene, polypropylene, polyethylene, dextran, amylases, polyacrylamides, 
polyvinylidene difluoride, other agaroses, and magnetite, including magnetic beads. The 

20 carrier can be totally iasoluble or partially soluble and may have any possible structural 
configuration. Thus, the support may be spherical, as in a bead, or cylindrical, as in the 
inside surface of a test tube or microplate well, or the external surface of a rod. Altematively, 
the surface maybe flat such as a sheet, test strip, bottom surface of a microplate weU, etc. 
The 2-0 sulfatase of the invention may also be used to remove active GAGrs firom a 

25 GAG containing fluid. A GAG containing fluid is contacted Avith the 2-0 sulfatase of the 
invention to degrade the GAG. The method is particularly useful for the ex vivo removal of 
GAGs fiom blood. In one embodiment of the invention the 2-0 sulfatase is immobilized on a 
sohd support as is conventional in the art. The sohd support containing the iimnobihzed 2-0 
sulfatase maybe used in extracorporeal medical devices (e.g. hemodialyzer, 

30 pump-oxygenator) for systemic heparinization to prevent the blood in the device from 

clotting. The support membrane containing immobilized 2-0 sulfatase is positioned at the 
end of the device to neutralize the GAG before the blood is returned to the body. 
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2-0 sulfatase and the resulting GAG fragments also have maay ther^eutic utilities. 
A "therapeutic GAG fragment" as used herein refers to a molecule or molecules which are 
degraded GAGs or pieces or fragments thereof that have been degraded through the use of 
the 2-0 sulfatase possibly along with other GAG - degrading enzymes, (e.g. native and/or 
modified heparinases). Such compounds may be generated using 2-0 sulfatase to produce 
therapeutic fragments or they may be synthesized de novo. Putative GAG fragments can be 
tested for therapeutic activity using any of the assays described herein or known in the art. 
Thus the therapeutic GAG fragment may be a synthetic GAG fragment generated based on 
the sequence of the GAG fragment identified when the tumor is contacted with 2-0 sulfatase, 
or having minor variations which do not interfere with the activity of the compound. 
Alternatively the therq>eutic GAG fragment may be an isolated GAG fragment produced 
when the tumor is contacted with 2-0 sulfatase. 

The 2-0 sulfatase and/or GAG fragments can be used for the treatment of any type of 
condition in which GAG fragment therapy has been identified as a useful therapy, such as 
preventing coagulation, inhibiting angiogenesis, preventing neovascularization, inhibiting 
proliferation, regulating ^optosis, etc. The methods of the invention also enable one of skill 
in the art to prepare or identify an appropriate composition of GAG fragments, depending on 
the subject and the disorder being treated. These compositions of GAG fragments may be 
used alone or in combination with the 2-0 sulfatase and/or other enzymes. Likewise 2-0 
sulfatase and/or other enzymes may also be used to produce GAG fragments in vrvo. 

The iuvention is useful for treating and/or preventing any disease/condition in a 
subject whereby glycosaminoglycans have been foimd to be important in the development 
and/or progress of the disease. The terms **treaf' and "treating" as used herein refers to 
reversing or blocking the progression of the disease m the subject. Treating a disease also 
includes exacting a desired improvement in the disease or symptoms of the disease. For 
example to treat a subject with tumor cell proliferation refers to inhibiting completely or 
partially the proliferation or metastasis of a cancer or tumor cell, as weU as inhibiting or 
preventing any increase in the proliferation or metastasis of a cancer or tumor cell. 

A "subject having a disease" is a subject that can be diagnosed as having the disease, 
e.g., a person having cancer is identified by the presence of cancerous cells. A "subject at 
risk of having a disease" as used herein is a subject who has a high probabiUty of developing 
the disease. These subjects include, for instance, subjects having a genetic abnormality, the 
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presence of which has been demonstrated to have a correlative relation to a higher likelihood 
of developing the disease. For diseases brought about by exposure to disease causing agents, 
subjects at risk are those who are exposed to the disease causing agents such as tobacco, 
asbestos, chemical toxins, viruses, parasites, etc. A subject at risk also includes those who 
have previously been treated for the disease and have the possibility of having a recurrence of 
the disease. When a subject at risk of developing a disease is treated with a 2-0 sulfatase, a 
cocktail of 2-0 sulfatase along with other GAG - degrading enzymes (e.g. heparinase and 
A4, 5 glycuronidase) or degradation products thereof the subject is able to prevent the 
occurrence of the disease or reduce the possibility of developing the disease. 

The compositions of the invention, therefore, can be used for the treatment of any 
type of condition in which GAG fragment ther^y has been identified as a useful therapy. 
Thus, the invention is useful in a variety of i?t yitro, in vivo and ex vivo methods in which 
therapies are useful. For instance, GAG fragments can also be useful for treating or 
preventing cancer, atherosclerosis, neurodegenerative disease (eg. Alzheuner's), microbial 
infection, psoriasis, etc. GAG fragments can also be useful in tissue repair. The GAG 
fragment compositions may also be used in in vitro assays, such as a quality control sample. 

Each of these disorders mentioned herein is well-known in the art and is described, 
for instance, in Harrison 's Principles of Internal Medicine (McGraw Hill, hic, ISfew York), 
which is incorporated by reference. 

In one embodiment the preparations of the invention are used for mhibiting 
angiogenesis. An effective amount for inhibiting angiogenesis of the GAG fragment 
preparation is administered to a subject in need of treatment thereof. Angiogenesis as used 
herein is the inappropriate formation of new blood vessels. "Angiogenesis" often occurs in 
tumors when endothelial cells secrete a group of growth factors that are mitogenic for 
endothelium causing the elongation and proliferation of endotheUal cells which results in a 
generation of new blood vessels. Several of the angiogenic mitogens are heparin buiding 
peptides which are related to endothelial cell growth factors. The inhibition of angiogenesis 
can cause tumor regression in animal models, suggesting a use as a therapeutic anticancer 
agent. An effective amount for mhibiting angiogenesis is an amount of GAG fragment 
preparation which is sufficient to diminish the number of blood vessels growing into a tumor. 
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This amount can be assessed in an animal model of tumors and angiogenesis, many of which 
are known in the art. 

Thus, the 2-0 sulfatase molecules are useful for treating or preventing disorders 
associated with coagulation. A "disease associated with coagulation" as used herein refers to 
a condition characterized by an interruption in the blood supply to a tissue due to a blockage 
of the blood vessel responsible for supplying blood to the tissue such as is seen for 
myocardial or cerebral infarction. A cerebral ischemic attack or cerebral ischemia is a form 
of ischemic condition in which the blood supply to the brain is blocked. This interruption in 
the blood supply to the brain may result from a variety of causes, including an intrinsic 
blockage or occlusion of the blood vessel itself, a remotely originated source of occlusion, 
decreased perfusion pressure or increased blood viscosity resulting in inadequate cerebral 
blood flow, or a ruptured blood vessel in the subarachnoid space or intracerebral tissue. 

A "disease associated with coagulation" as used herein also is intended to encompass 
atherosclerosis. Atherosclerosisis a disease of the arteries whereby blood flow can be 
reduced due to the development of atheromatous plaques along the interior walls of the 
arteries. These plaques begin by the initial deposition of cholesterol crystals which grow 
larger with time. In addition to the cholesterol deposition, plaques also grow due to the 
proliferation of the surroxmding cells, hi time, the artery may become completely occluded 
due to this plaque growth. 

The 2-0 sulfatase or the GAG fragments generated therewith may be used alone or iu 
combination with a therapeutic agent for treating a disease associated with coagulation. 
Examples of therapeutics useful in the treatment of diseases associated with coagulation 
include anticoagulation agents, antiplatelet agents, and thrombolytic agents. 

Anticoagulation agents prevent the coagulation of blood components and thus prevent 
clot formation. Anticoagulants include, but are not limited to, heparin, warfarin, Coumadin, 
dicumarol, phenprocoumon, acenocoumarol, ethyl biscoumacetate, and indandione 
derivatives. 

Antiplatelet agents inhibit platelet aggregation and are often used to prevent 
thromboembolic stroke in patients who have experienced a transient ischemic attack or 
stroke. Antiplatelet agents include, but are not limited to, aspirin, thienopyridine derivatives 
such as ticlopodine and clopidogrel, dipyridamole and sulfinpyrazone, as well as RGD 
mimetics and also antithrombin agents such as, but not limited to, hirudin. 
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Thrombolytic agents lyse clots which cause the thromboembolic stroke. 
Thrombolytic agents have been used in the treatment of acute venous thromboembolism and 
pulmonary emboli and are well known in the art (e.g. see Hennekens et al, J Am Coll Cardiol; 
V. 25 (7 supp), p. 18S-22S (1995); Hohnes, et al, J Am Coll Cardiol; v.25 (7 suppl), p. lOS- 

5 178(1995)). Thrombolytic agents include, but are not limited to, plasminogen, aa- 

antiplasmin, streptokinase, antistreplase, tissue plasminogen activator (tPA), and urokinase. 
*tPA" as used herein includes native tPA and recombinant tPA, as well as modified fonns of 
tPA that retain the enzymatic or fibrinolytic activities of native tPA. The enzymatic activity 
of tPA can be measured by assessing the ability of the molecule to convert plasminogen to 

10 plasmin. The fibrinolytic activity of tPA may be determined by any in vitro clot lysis activity 
known in the art, such as the purified clot lysis assay described by Carlson, et. al., Anal 
Biochem, 168, 428-435 (1988) and its modified form described by Bennett, W. F. et al., 1991,. 
J. Biol. Chem. 266(8):5 19 1-5201, the entire contents of which are hereby incorporated by 
reference. 

15 The compositions as described herein can also be used to prevent or treat 

'^neurodegenerative disease" is defined herein as a disease in which progressive loss of 
neurons occurs either in the peripheral nervous system or in the central nervous system. 
Examples of neurodegenerative disorders include famihal and sporadic amyotrophic lateral 
sclerosis (FALS and ALS, respectively), familial and sporadic Parkinson's disease, 

20 Huntington's disease, famihal and sporadic Alzheimer's disease, multiple sclerosis, 
ohvopontocerebeliar atrophy, multiple system atrophy, progressive supranuclear palsy, 
diffixse Lewy body disease, corticodentatonigral degeneration, progressive familial myoclonic 
epilepsy, strionigral degeneration, torsion dystonia, familial tremor, Down's Syndrome, 
Gilles de la Tourette syndrome, Hallervorden-Spatz disease, diabetic peripheral neuropathy, 

25 dementia pugiUstica, AIDS dementia, age related dementia, age associated memory 

impairment, amyloidosis-related neurodegenerative diseases such as those caused by the 
prion protein (PrP) which is associated with transmissible spongiform encephalopathy 
(Creutzfeldt-Jakob disease, Gerstmann-Straussler-Scheinker syndrome, scrapie, bovme 
spongiform encephalopathy and kuru), and those caused by excess cystatin C accumulation 

30 (hereditary cystatin C angiopathy), traumatic brain mjury (e.g., surgery-related bram injury), 
cerebral edema, peripheral nerve damage, spinal cord injury, Wemicke-KorsakofPs related 
dementia (alcohol induced dementia), and presenile dementia. The foregomg examples are 
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not meant to be comprehensive but serve merely as an illustration of the term 
"neurodegenerative disease". 

The invention also provides treatment or prevention of a neurodegenerative disease by 
the administration of the 2-0 sulfatase and/or GAG fragment compositions described herein 
5 possibly in conjunction with other ther^eutic agents for the particular condition being 
treated. The administration the other therapeutics may be performed concomitantly, 
sequentially or at different time points. 

For example, when treating Alzheimer's Disease, the therapeutic agents which can be 
combined with the compositions of the invention include, but are not limited to, estrogen, 
10 vitamin E (alpha-tocopherol). Tacrine (tetrahydroacridinamine), selegiline (deprenyl), and 
Aracept (donepezil). One of ordinary skill in the art will be familiar with additional 
therapeutic agents useful for the treatment of neurodegenerative diseases. 

Critically, HSGAGs (along with collagen) are key components of the cell surface- 
extracellular matrix (ECM) interface. While coUagen-Uke proteins provide the necessary 
15 extracellular scaflFold for cells to attach and form tissues, the complex pol>^accharides fill the 
space created by the scaffold and act as a molecular sponge by specifically binding and 
regulating the biological activities of numerous signaling molecules like growth factors, 
cytokines etc. Therefore, the compositions provided herein can also be used in methods of 
repairing tissues. 

20 In addition, as it had been found that viruses and parasites utilize glycosaminoglycans 

such as heparan sulfate as receptors to infect target cells (Liu, J., and Thoip, S. C. (2002) Med 
Res Rev 22(1), 1-25), the compositions of the invention may also be used to treat or prevent 
microbial infections. The compositions of the invention can also be administered in 
combination with other antiviral agents or antiparasitic agents. 

25 Antiviral agents are compounds which prevent infection of cells by vimses or 

replication of the virus within the cell. There are several stages within the process of viral 
infection which can be blocked or inhibited by antiviral agents. These stages include, 
attachment of the virus to the host cell (immunoglobuUn or binding peptides), uncoating of 
the virus (e.g., amantadine), synthesis or translation of viral mRNA (e.g., interferon), 

30 replication of viral RNA or DNA (e.g., nucleoside analogues), maturation of new virus 
proteins (e.g., protease inhibitors), and budding and release of the virus. 



^.WOZ0Q4/0625S2. 



^ PCT/L'S2i. '.i/(BGI3fUS2004/000332 



-46- 

Examples of antiviral agents known in the art are nucleotide analogues which include, 
but are not limited to, acyclovir (used for the treatment of herpes simplex vims and varicella- 
zoster virus), gancyclovir (usefiil for the treatment of cytomegalovirus), idoxuridine, ribavirin 
(useful for the treatment of respiratory syncitial virus), dideoxyinosine, dideoxycytidine, and 
5 zidovudine (azidothymidine). 

It has also been recently been recognized that cells synthesize distinct HSGAG 
sequences and decorate themselves with these sequences, using the extraordinary information 
content present in the sequences to bind specifically to many signaling molecules and thereby 
regulate various biological processes. The processes include apoptosis (Ishikawa, Y., and 
10 Kitamnra, M. (1999) Kidney Int 56(3), 954-63, Kapila, Y. L., Wang, S., Dazin, P., TafoUa, 
E., and Mass, M. J. (2002) J Biol Chem 277(10), 8482-91). Regulation of apoptosis with the 
compositions of the invention can prove important to a variety of diseases whereby an 
increase or decrease in cell death is warranted. Apoptosis is known to play a role in 
numerous physiologic and pathologic events such as embryogenesis and metamorphosis, 
15 homaone-dependent involution in the adult, cell death in tumors, atrophy of some organs and 
tissues, etc. 

As the compositions of the invention are usefiil for the same purposes as heparinases 
and the degradation products of heparinases (HSGAG fragments), they are also useful for 
treating and preventing cancer cell proliferation and metastasis. Thus, according to another 

20 aspect of the invention, there is provided methods for treating subjects having or at risk of 
having cancer. The cancer may be a malignant or non-malignant cancer. Cancers or tumors 
include but are not limited to biliary tract cancer; brain cancer, breast cancer; cervical cancer; 
choriocarcinoma; colon cancer; endometrial cancer; esophageal cancer; gastric cancer; 
intraepithelial neoplasms; lymphomas; liver cancer; lung cancer (e.g. small cell and 

25 non-small cell); melanoma; neuroblastomas; oral cancer; ovarian cancer; pancreas cancer; 
prostate cancer; rectal cancer; sarcomas; skin cancer; testicular cancer; thyroid cancer; and 
renal cancer, as well as other carcinomas and sarcomas. 

The invention also encompasses screening assays for identifying therapeutic GAG 
fragments for the treatment of a tumor and for preventing metastasis. The assays are 

30 accomplished by treating a tumor or isolated tumor cells with 2-0 sulfatase and/or other 
native or modified heparinases and isolating the resultant GAG fragments. Surprisingly, 
these GAG fragments have therapeutic activity in the prevention of tumor cell proliferation 
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and metastasis. Thus the invention encompasses individualized therapies, in which a tumor 
or portion of a tumor is isolated jfrom a subject and used to prepare the therapeutic GAG 
fragments. These therapeutic fragments can be re-administered to the subject to protect the 
subject from further tumor cell proliferation or metastasis or from the initiation of metastasis 
5 if the tumor is not yet metastatic. Alternatively the fingments can be used in a different 
subject having the same type or tumor or a different type of tumor. 

The invasion and metastasis of cancer is a complex process which involves changes in 
cell adhesion properties which allow a transformed cell to invade and migrate through the 
extracellular matrix (ECM) and acquire anchorage-independent growth properties (Liotta, L. 

10 A., et al. Cell 64:327-336, 1991). Some of these changes occur at focal adhesions, which are 

cell/ECM contact points containing membrane-associated, cytoskeletal, and intracellular ^ 
signaUng molecules. Metastatic disease occurs when the disseminated foci of tumor cells 
seed a tissue which supports their growth and propagation, and this secondary spread of 
tumor cells is responsible for the morbidity and mortaUty associated with the majority of 

15 cancers. Thus the term "metastasis" as used herein refers to the invasion and migration of 
tumor cells away from the primary tumor site. 

The barrier for the tumor cells may be an artificial barrier in vitro or a natural barrier 
in vivo. In vitro barriers include but are not limited to extracellular matrix coated 
membranes, such as Matrigel. Thus the 2-0 sulfatase compositions or degradation products 

20 thereof can be tested for their ability to inhibit tumor cell invasion in a Matrigel invasion 
assay system as described in detail by Parish, C.R., et al., "A Basement-Membrane 
PermeabiUty Assay which Correlates with the Metastatic Potential of Tumour Cells," Int. J. 
Cancer, 1992, 52:378-383. Matrigel is a reconstituted basement membrane containing type 
IV collagen, laminin, heparan sulfate proteoglycans such as perlecan, which bind to and 

25 localize bFGF, vitronectin as well as transforming growth factor- p (TGF-p), urokinase-type 
plasminogen activator (uPA), tissue plasminogen activator (tPA), and the seipin known as 
plasminogen activator inhibitor type 1 (PAI-1). Other in vitro and in vivo assays for 
metastasis have been described in the prior art, see, e.g., U.S. Patent No. 5,935,850, issued 
on August 10, 1999, which is incorporated by reference. An in vivo barrier refers to a 

30 cellular barrier present in the body of a subj ecL 
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Effective amounts of the 2-0 sulfatase, functional variants thereof or therapeutic 
GAGs of the invention are administered to subjects in need of such treatment. Effective 
amounts are those amounts which will result in a desired improvement in the condition or 
symptoms of the condition, e.g., for cancer this is a reduction in cellular proliferation or 
metastasis, without causing other medically unacceptable side effects. Such amounts can be 
determined with no more than routine experimentation. It is beheved that doses ranging from 
1 nanogram/kilogram to 100 milligrams/kilogram, depending upon the mode of 
administration, will be effective. The absolute amount will depend upon a variety of factors 
(including whether the administration is m conjunction with other methods of treatment, the 
number of doses and individual patient parameters including age, physical condition, size and 
weight) and can be determined with routine experimentation. It is preferred generally that a 
maximum dose be used, that is, the highest safe dose according to sound medical judgment. 
The mode of administration may be any medically acceptable mode including oral, 
subcutaneous, intravenous, etc. 

In general, when administered for therapeutic pmposes, the formulations of the 
invention are ^phed in pharmaceutically acceptable solutions. Such preparations may 
routinely contain pharmaceutically acceptable concentrations of salt, buffering agents, 
preservatives, compatible carriers, adjuvants, and optionally other therapeutic ingredients. 

The compositions of the invention may be administered per se (neat) or in the form of 
a pharmaceutically acceptable salt. When used in medicine the salts should be 
pharmaceutically acceptable, but non-phannaceutically acceptable salts may conveniently be 
used to prepare pharmaceutically acceptable salts thereof and are not excluded from the scope 
of the invention. Such pharmacologically and pharmaceutically acceptable salts include, but 
are not limited to, those prepared from the foUowmg acids: hydrochloric, hydrobromic, 
sulphuric, nitric, phosphoric, maleic, acetic, sahcylic, p-toluene sulphonic, tartaric, citric, 
methane sulphonic, formic, malonic, succinic, naphthalene-2-sulphonic, and benzene 
sulphonic. Also, pharmaceutically acceptable salts can be prepared as alkaline metal or 
alkaline earth salts, such as sodium, potassium or calcium salts of the carboxyUc acid group. 

Suitable buffering agents include: acetic acid and a salt (1-2% WAO; citric acid and a 
salt (1-3% WAO; boric acid and a salt (0.5-2.5% WA'^); and phosphoric acid and a salt 
(0.8-2% W/V). Suitable preservatives include benzalkonium chloride (0.003-0.03% WAO; 
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cMorobutanoi (0.3-0.9% WAO; parabens (0.01-0.25% W/V) and thimerosal (0.004-0.02% 
WAO. 

The present invention provides pharmaceutical compositions, for medical use, which 
comprise 2-0 sulfatase, functional variants thereof or therapeutic GAG fragments together 
with one or more pharmaceutically acceptable carriers and optionally other therapeutic 
ingredients. The term "pharmaceutically-acceptable carrier" as used herein, and described 
more fully below, means one or more compatible solid or hquid JSUer, dilutants or 
encapsulating substances which are suitable for administration to a human or other animal. 
In the present invention, the tenn "carrier" denotes an organic or inorganic ingredient, natural 
or synthetic, with which the active ingredient is combmed to facilitate the appUcation. The 
components of the pharmaceutical compositions also are capable of being commingled with 
the 2-0 sulfatase of the present invention or other compositions, and with each other, in a 
maimer such that there is no interaction which would substantially impair the desired 
pharmaceutical efficiency. 

A variety of administration routes are available. The particular mode selected will 
depend, of course, upon the particular active agent selected, the particular condition being 
treated and the dosage required for therapeutic efficacy. The methods of this invention, 
generally speaking, may be practiced using any mode of administration that is medically 
acceptable, meaning any mode that produces effective levels of an immune response without 
caiisrng cUnically unacceptable adverse effects. A preferred mode of administration is a 
parenteral route. The term "parenteral" includes subcutaneous injections, intravenous, 
intramuscular, intraperitoneal, intra sternal injection or infusion techniques. Other modes of 
administration include oral, mucosal, rectal, vaginal, sublingual, intranasal, intratracheal, 
inhalation, ocular, transdermal, etc. 

For oral administration, the compoimds can be formulated readily by combining the 
active compound(s) with pharmaceutically acceptable carriers well known in the art. Such 
carriers enable the compoimds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syn^s, slurries, suspensions and the Hke, for oral ingestion by a 
subject to be treated. Pharmaceutical preparations for oral use can be obtained as soUd 
excipient, optionally grinding a resulting mixture, and processing the mixture of graniiles, 
after adding suitable auxiUaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or 
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sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium caiboxymethylcellulose, and/or polyvinylpyrrohdone (PVP). If desired, 
disintegrating agents may be added, such as the cross-linked polyvinyl pyrrohdone, agar, or 
5 alginic acid or a salt thereof such as sodium alginate. Optionally the oral formulations may 
also be formulated in saline or buffers for neutraUzing internal acid conditions or may be 
administered without any carriers. 

Dragee cores are provided with suitable coatings. For this purpose, concentrated 
sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl 
10 pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, 
and suitable organic solvents or solvent mixtures. Dyestuifs or pigments maybe added to the 
tablets or dragee coatings for identification or to characterize different combinations of active 
compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made 

15 of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 
or sorbitol. The p-ush-fit capsules can contain the active ingredients in admixture with filler 
such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate 
and, optionally, stabiUzers. Jn soft capsules, the active compounds maybe dissolved or 
suspended in suitable hquids, such as fatty oils, hquid paraffin, or liquid polyethylene 

20 glycols. In addition, stabiUzers may be added. Microspheres formulated for oral 

administration may also be used. Such microspheres have been well defined in the art. All 
formulations for oral administration should be in dosages suitable for such administration. 

For buccal administration, the compositions may take the form of tablets or lozenges 
formulated in conventional maimer. 

25 For administration by inhalation, the compomids for use accx)rding to the present 

invention may be conveniently delivered in the form of an aerosol spray presentation firom 
pressurized packs or a nebulizer, with the use of a suitable propellant, e.g,, 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoro ethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 

30 by providing a valve to deUver a metered amount. Capsules and cartridges of e.g. gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 



WO2004/0d2S9!2 



PCT/US2004/000332 



-51- 

The compounds, when it is desirable to deliver them systemically, may be fomiulated 
for parenteral administration by injection, by bolns injection or continuous infusion. 
Formulations for injection may be presented in unit dosage form, e,g,, in ampoules or in 
multi-dose containers, with an added preservative. The compositions may take such forms as 
5 suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabihzing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions 
of the active compounds in water-soluble form. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophihc 
10 solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as 
ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubiUty of the compounds to allow for the 
1 5 preparation of highly concentrated solutions. 

Alternatively, the active compounds may be m powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal or vaginal compositions such as 
suppositories or retention enemas, eg., containing conventional suppository bases such as 
20 cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be formulated with 
suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable 
oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly 
25 soluble salt 

The pharmaceutical compositions also may comprise suitable sohd or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, 
and polymers such as polyethylene glycols. 
30 Suitable Uquid or soHd pharmaceutical preparation forms are, for exanq)le, aqueous or 

saline solutions for inhalation, microencqjsulated, encochleated, coated onto microscopic 
gold particles, contained in hposomes, nebulized, aerosols, pellets for unplantation into the 
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skm, or dried onto a sharp object to be scratched into the skin. The phaimaceutical 
compositions also include granules, powders, tablets, coated tablets, (niicro)capsules, 
suppositories, syrups, emulsions, suspensions, creams, drops or preparations with protracted 
release of active compounds, in whose preparation excipients and additives and/or auxiliaries 
such as disintegrants, binders, coating agents, swelling agents, lubricants, flavorings, 
sweeteners or solubilizers are customarily used as described above. The pharmaceuticai 
compositions are suitable for use in a variety of drug delivery systems. For a brief review of 
methods for drug delivery, see Langer, Science 249:1527-1533, 1990, which is mcorporated 
herein by reference. 

The compositions may conveniently be presented in unit dosage form and may be 
prepared by any of the methods well known in the art of pharmacy. All methods include the 
step of bringing the active 2-0 sulfatase into association with a carrier which constitutes one 
or more accessory ingredients. In general, the compositions are prepared by uniformly and 
intimately bringing the polymer into association with a liquid carrier, a finely divided solid 
carrier, or both, and then, if necessary, shaping the product. The compositions may be stored 
lyophilized. 

Other dehvery systems can include time-release, delayed release or sustained release 
dehvery systems. Such systems can avoid repeated administrations of the heparinases of the 
invention, increasing convenience to the subject and the physician. Many types of release 
delivery systems are available and known to those of ordinary skill in the art. They include 
polymer based systems such as polylactic andpolyglycolic acid, polyanhydrides and 
polycaprolactone; nonpolymer systems that are lipids including sterols such as cholesterol, 
cholesterol esters and fatty acids or neutral fats such as mono-, di and triglycerides; hydrogel 
release systems; silastic systems; peptide based systems; wax coatings, compressed tablets 
using conventional binders and excipients, partially fused implants and the like. Specific 
examples include, but are not limited to: (a) erosional systems in which the polysaccharide is 
contained in a fonn within a matrix, found in U.S. Patent Nos. 4,452,775 (Kent); 4,667,014 
(Nestor et al); and 4,748,034 and 5,239,660 (Leonard) and (b) diflRxsional systems in which 
an active component permeates at a controlled rate through a polymer, found in U.S. Patent 
Nos. 3,832,253 (Higuchi et al.) and 3,854,480 (Zaffaroni). hi addition, a pump-based 
hardware delivery system can be used, some of which are adapted for implantation. 

A subject is any human or non-human v^tebrate, e.g., dog, cat, horse, cow, pig. 
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When administered to a patient undergoing cancer treatment, the 2-0 sulfatase or 

therapeutic GAG compounds may be administered in cocktails containing other anti-cancer 

agents. The compounds may also be adnainistered in cocktails containing agents that treat the 

side-effects of radiation therapy, such as anti-emetics, radiation protectants, etc. 
5 Anti-cancer drugs that can be co-administered with the compounds of the invention 

include, but are not limited to Acivicin; Aclarubicin; Acodazole Hydrochloride; Acronine; 

Adriamycin; Adozelesin; Aldesleukin; Altretamine; Ambomycin; Ametantrone Acetate; 

Aminoglutethimide; Amsacrine; Anastrozole; Anthramycin; Asparaginase; Asperlin; 

Azacitidine; Azetepa; Azotomycin; Batimastat; Benzodepa; Bicalutamide; Bisantrene 
1 0 Hydrochloride; Bisnafide Dimesylate; Bizelesin; Bleomycin Sulfate; Brequinar Sodium; 

Bropirimine; Busulfan; Cactinomycin; Calusterone; Caracemide; Carbetimer; Carboplatin; 

Carmustine; Carubicin Hydrochloride; Carzelesin; Cedefingol; Chlorambucil; Cirolemycin; 

Cisplatin; Cladribine; Crisnatol Mesylate; Cyclophosphamide; Cytarabine; Dacarbazine; 

Dactinomycin; Daunorubicin Hydrochloride; Decitabine; Dexormaplatin; Dezaguanine; 
15 Dezaguanine Mesylate; Diaziquone; Docetaxel; Doxorubicin; Doxombicin Hydrochloride; 

Droloxifene; Droloxifene Citrate; Dromostanolone Propionate; Duazomycin; Bdatrexate; 

Eflomithine Hydrochloride; Elsamitrucin; Enloplatin; Enpromate; Epipropidine; Epirubicin 

Hydrochloride; Erbulozole; Esorubicin Hydrochloride; Estramustine; Estramustine Phosphate 

Sodium; Etanidazole; Etoposide; Etoposide Phosphate; Etoprine; Fadrozole Hydrochloride; 
20 Fazarabine; Fenretinide; Floxuridine; Fludarabine Phosphate; Fluorouracil; Flurocitabme; 

Fosquidone; Fostriecin Sodium; Gemcitabine; Gemcitabine Hydrochloride; Hydroxyurea; 

Idarubicin Hydrochloride; Ifosfamide; Ihnofosine; Interferon Alfa-2a; Ititerferon Alfa-2b; 

Interferon Alfa-nl ; Interferon Alfa-n3; Interferon Beta- 1 a; Interferon Gamma- 1 b; 

Iproplatin; Irinotecan Hydrochloride; Lanreotide Acetate; Letrozole; LeuproUde Acetate; 
25 Liarozole Hydrochloride; Lometrexol Sodium; Lomustine; Losoxantrone Hydrochloride; 

Masoprocol; Maytansine; Mechlorethamine Hydrochloride; Megestrol Acetate; Melengestrol 

Acetate; Melphalan; Menogaiil; Mercaptopurine; Methotrexate; Methotrexate Sodium; 

Metoprine; Meturedepa; Mitindomide; Mitocarcin; Mitocromin; Mitogillin; Mitomalcin; 

Mitomycin; Mitosper; Mitotane; Mitoxantrone Hydrochloride; MycophenoUc Acid; 
30 Nocodazole; Nogalamycin; Orm^latin; Oxisuran; Pachtaxel; Pegaspargase; Peliomycin; 

Pentamustine; Peplomycin Sulfate; Perfosfamide; Pipobroman; Piposulfan; Piroxantrone 

Hydrochloride; PHcamycin; Plomestane; Porfimer Sodium; Porfiromycin; Prednimustme; 
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Procaxbazine Hydrochloride; Puromycin; Puromycin Hydrochloride; Pyrazofiirin; Riboprine; 
Rogletiinide; Safingol; Safingol Hydrochloride; Semustine; Simtrazene; Sparfosate Sodium; 
Sparsomycin; Spirogennanimn Hydrochloride; Spiromustine; Spiroplatin; Streptonigrin; 
Streptozocin; Sulofenur; Talisomycin; Tecogalan Sodium; Tegafur; Teloxantrone 
5 Hydrochloride; Temoporfin; Teniposide; Teroxirone; Testolactone; Thiandprine; 

Thioguanine; Thiotepa; Tiazofurin; Tirapazamine; Topotecan Hydrochloride; Toremifene 
Citrate; Trestolone Acetate; Tricirihine Phosphate; Trimetrexate; Trimetrexate Glucuronate; 
Triptorelin; Tubxilozole Hydrochloride; Uracil Mustard; Uredepa; Vapreotide; Verteporfin; 
Vinblastine Sulfate; Vincristine Sulfate; Vindesine; Vindesine Sulfate; Vinepidine Sulfate; 
10 Vinglycinate Sulfate; Vmleurosine Sulfate; Vinorelbine Tartrate; Vinrosidine Sulfate; 
Vinzolidine Sulfate; Vorozole; Zeniplatin; Zinostatin; Zorubicin Hydrochloride. 

The 2-0 sulfatase or therq)eutic GAG compounds may also be linked to a targeting 
molecule. A targeting molecule is any molecule or compound which is specific for a 
particular cell or tissue and which can be used to direct the 2-0 sulfatase or therapeutic GAG 
15 to the cell or tissue. Preferably the targeting molecule is a molecule which specifically 

interacts with a cancer cell or a tumor. For instance, the targeting molecule may be a protein 
or other type of molecule that recognizes and specifically interacts with a tumor antigen. 

Tumor-antigens include Melan-A/MART-1, Dipeptidyl peptidase IV (DPPIV), 
adenosine deaminase-binding protein (ADAbp), cyclophilin b, Colorectal associated antigen 
20 (CRC)-C017-1 A/GA733, Carcrnoembryonic Antigen (CEA) and its immunogenic epitopes 
CAP-1 and CAP-2, etv6, amll, Prostate Specific Antigen (PSA) and its immunogenic 
epitopes PSA-1, PSA-2, and PSA-3, prostate-specific membrane antigen (PSMA), T-cell 
receptor/CD3-zeta chain, MAGE-family of tumor antigens (e.g., MAGE-Al, MAGE-A2, 
MAGE-A3, MAGE-A4, MAGE-A5, MAGE-A6, MAGE-A7, MAGE-A8, MAGE-A9, 
25 MAGE-AIO, MAGE-Al 1, MAGE-A12, MAGE-Xp2 (MAGE-B2), MAGE-Xp3 (MAGE- 
B3), MAGB-Xp4 (MAGE-B4), MAGE-Cl, MAGE-C2, MAGE-C3, MAGE-C4, MAGE- 
C5), GAGE-family of tumor antigens (e.g., GAGE-1, GAGE.2, GAGE-3, GAGE-4, GAGE- 
S' GAGE-6, GAGE-7, GAGE-8, GAGE-9), BAGE, RAGE, LAGE-1, NAG, GnT-V, MUM- 
1, CDK4, tyrosinase, p53, MUG family, HER2/neu, p21ras, RCASl, a-fetoprotein, E- 
30 cadherin, a-catenin, p-catenin and y-catenin, pl20ctn, gplOO^"^'^^^^ PRAME, NY-ESO-1, 
brain glycogen phosphorylase, SSX-1, SSX-2 (HOM-MEIMO), SSX-1, SSX-4, SSX-5, SOP- 
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1, CT-7, cdc27, adenomatous polyposis coli protein (APC), fodrin, PI A, Connexin 37, Ig- 
idiotype, pl5, gp75, GM2 and GD2 gangliosides, viral products such as human papilloma 
virus proteins, Smad family of tumor antigens, hnp-1, EBV-encoded nuclear antigen 
(EBNA)-l,andc-erbB-2. 
5 The present invention is further illustrated by the following Examples, which in no way 

should be construed as further limiting. The entire contents of all of the references (including 
Uterature references, issued patents, pubUshed patent apphcations, and co-pending patent 
applications) cited throughout this apphcation are hereby expressly incorporated by reference. 

10 EXAMPLES 

Materials And Methods 

Reagents — Heparin and chondroitin disaccaharides were purchased from Calbiochem (La 
Jolla, CA). Unfractionated heparin was obtained from Celsus Laboratories (Cincinatti, OH). 

15 The unsaturated heparin tetrasaccharide AU2sHns,6sI2sHns.6s (Tl) decasaccharide 
AU2sHns,6sI2sHns,6sI2sHns.6sIHnac,6sGHns,3S,6S (AT-10) were generated by a partial 
heparinase digestion and purified as described (Toida, T., Hileman, R. E., Smith, A. E., 
Vlahova, P. L, and Linhardt, R. J. (1996) J Biol Chem 271(50), 32040-7). Materials for 
XZAP n genomic Hbrary construction, screening and phagemid excision including 

20 bacteriophage host strain XLlBlue MRF and the helper-resistant strain SOLR were obtained 
from Stratagene (La JoUa, CA) and used according to the Manufacturer's instructions. 
Restriction endonucleases and molecular cloning and PGR enzymes were purchased from 
New England Biolabs (Beverly, MA), DNA oUgonucleotide primers were synthesized by 
hivitrogen/Life Technologies custom primer service (Carhsbad, CA). TOP 10 chemically 

25 competent cells for PGR cloning and subcloning were also obtained from hivitrogen. 

[^^P]dCTP radionuclides were purchased from NEN (Boston, MA). Additional molecular 
cloning reagents were obtained from the manufacturers hsted. Modified trypsin (sequencing 
grade) was purchased from Roche Molecular Biochemicals (Indianq)ohs, IN)- Texas Red 
hydrazine was purchased from Molecular Probes (Eugene, OR). All other reagents were 

30 from Sigma-Aldrich (St Louis, MO) unless otherwise noted. 
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Purification of the Flavobacterium heparinum 2-0 sulfatase and subsequent proteolysis — 
The 2-0 sulfatase was purified fi-om 20 liter fermentation ctdtures. Briefly, the large-scale 
cultures were grown at 25°C for 48 hours. Cell lysates were obtained by a repeated passage 
of a resuspended cell pellet through an Aminco French-pressure cell. The homogenate was 

5 clarified by centrifiigation (37000 X g). The 2-0 sulfatase was purified fixnn this cell-free 
supernatant by employing five chromatographic steps carried out in the following sequence: 
cation-exchange (CM-Sepaharose CL-6B)-> hydroxyapatite (Bio-Gel HTP) -> gel filtration 
(Sephadex G-50) taurLne-Sepharose CL-4B blue-Sepharose CL-6B. 2-0 sulfatase 
activity was measured at each chromatography step as described (McLean, M. W., Bruce, J. 

10 S., Long, W. F., and Williamson, F. B. (19SA) Eur J Biochem 145(3), 607-15). Fractions 
from 6 initial CM-sepharose chromatography were also assayed for heparinase, 
chondroitinase (AC and B) and A 4,5 glycuronidase activities as well as any co-eluting 6-0 or 
N sulfatase activities. The highly purified 2-0 sulfatase pool from the final blue-Sepharose 
chromatography step was free fix>m any contaminating glycosaminoglycan degrading 

15 activity. 

Generation of 2-0 sulfatase peptides and protein sequencing — hi preparation for proteolysis, 
the purified flavobacterial sulfatase was first desalted by reverse phase chromatography (RP- 
HPLC) on a 150 mm X 4.6 mm C4 column (Phenomenex, Torrance, CA). Protein was eluted 

20 by ^plying a linear gradient from 0-80% acetoiutrile in 0. 1% TFA. During this elution, both 
a major and minor protein peak was detected by UV absorbance at 21 0 mn and 277 nm 
(Fig. 1 Panel (A)). The two separate fractions were lyophilized to dryness and resuspended 
in 50 jiL of denaturation buffer (8M Urea, 0.4 M ammonium bicarbonate, pH 7.5). Both 
protein fractions were digested with modified trypsin for approximately 18 hoxirs at 37*^0. 

25 Trypsin was added at a 1 :40 ratio (w/w) relative to each sulfatase fraction. Prior to 

proteolysis, cysteines were first subjected to reductive carboxymethylation by the addition of 
5 mM dithiothreitol for 1 hour at 50*^C, followed by the addition of 20 mM iodoacetic acid 
for 30 minutes (room temperature). The alkylation reaction was quenched by the addition of 
50 [iL denaturation buffer. The resulting peptides were resolved by RP-HPLC on a 250 nam 

30 X 2 mm C4 column using a linear gradient of 2-80% acetonitrile ui 0. 1 % trifluoracetic acid 
carried out over a 120 minute timecourse. Select peptides corresponding to chromatography 
peaks 2, 3, 4, 5, and 8 (Fig. 1 Panel (B)) were sequenced using an on-line Model 120 
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phenylthiohydantoin-derivative analyzer (Biopolymers Laboratory, Massachusetts lostitute of 
Technology). 

Molecular cloning oftheflavobacterial 2-0 sulfatase — ^The 2-0 sulfatase was cloned from a 

5 XZAP n flavobacterial genomic Ubrary constructed and screened essentially as described for 
the A 4,5 glycuronidase (Myette, J. R., Shriver, Z., Kiziltepe, T., McLean, M. W., 
Venkataraman, G., and Sasisekharan, R. (2002) Biochemistry 41(23), lAlA-l^ZA). A 600 
base pair DNA plaque hybridization probe was generated by PGR using degenerate primers 
5' ATHGAYATHATHCCNACNATH 3' (forward, SEQ ID NO: 8) and 5' 

10 DATNGTYTCATTNCCRTGYTG 3' (reverse, SEQ ID NO: 9). PGR was carried out for 35 
cycles using a 52°C annealing temperature and 2 minute extensions at 72^C. The specificity 
of this probe was established by DNA sequence analysis, which indicated a direct 
correspondence of its translated sequence to peak 1 tryptic peptides. Based on this 
information, the non-degenerate primers 5' CATACACGTATGGGCGATTAT 3' (forward, 

15 SEQ ID NO: 10) and 5' GATGTGGGGATGATGTCGAT y (reverse, SEQ ID NO: 11) were 
subsequently used in place of the original degenerate primers. PGR ampUfied DNA probe 
was gel purified and subsequently ^^P radiolabeled using the Prime-it n random priming kit 
(Stratagene). Plaques were Ufted on to nylon membranes (Nytran Supercharge, Schleicher 
and Schuell, Keene, NH) and DNA was crossUnked to each filter by UV-iiradiation. Plaque 

20 hybridizations were completed overnight at 42°C according to standard methods and 
solutions {Current Protocols in Molecular Biology (1987) (Ausubel, F. M., Brent, R., 
Kingston, RE., Moore, D.D., Seidman, J.G., Smith, J.A., and Struhl, K., Ed.) 1-3 vols., John 
Wiley and Sons, New York). Positive clones were visualized by phosphor imaging 
(Molecular Dynamics, Piscataway, NJ) and/or ^^P autoradiography. Glones were fiirther . 

25 purified by secondary and tertiary screens and the recombinant phage was excised as a 
double-stranded phagemid (pBluescript) as described by the manufacturer (Stratagene). 
Recombinants were confirmed by DNA sequencing using both T7 and T3 primers. Insert 
size was determined by restriction mapping of pBluescript inserts using Not 1, Xba 1 , and 
Xhol. 

30 The fiiU-length sulfatase gene (phagemid clone S4A) was subcloned into the T7-based 

expression plasmid pET28a in three steps. In the first PGR step, Nde 1 and Xho Irestriction 
sites were introduced at the 5' and 3' termini of the 2-0 sulfatase coding sequence by using 
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primers 5' TGTTCTAGACATATGAAGATGTACAAATCGAAAGG 3 ' (SEQ ID NO: 12) 
and 5' GTCTCGAGGAT CCITATTTrTTTAATGCATAAAACGA^^ 3' (SEQ ID NO: 
13), respectively. At the same time, the Nde 1 restriction site already present within the 
sulfatase gene starting at position 1049 (Fig. 2) was abolished by silent mutagenesis 
5 (CATATG ^ CATCTG) using the mutagenic primers 5' 

GATATTATCCCCACCATCTGTGGCTTTGCCGGAA 3' (SEQ ID NO: 14) and 5' 
TTCCGGCAAAGCCACAGATGGrGGGGATAATATC 3' (SEQ W NO: 15), with the A 
to C transversion noted in bold. In the second step, the final PGR product was gel purified 
and hgated into the TOPO/TA PGR cloning vector pCR 2. 1 (Invitrogen) following the 

10 addition of 3' dA overhangs with 0.5 units of Taq polymerase and 300 |iM dATP (10 

minutes, 72''C). Ligated DNA was transformed into One-shot TOP 10 chemically competent 
cells. Positive clones were identified by blue/white colony selection and confirmed by PCR 
colony screening. In the third step, the 1 .5 kb sulfatase gene was excised from pCR 2. 1 
TOPO and pasted into pET28a (Novagen, Madison, WT) as an Nde 1-Xho 1 cassette. Final 

15 expression clones were confirmed by plasmid DNA sequencing. 

A 2-0 sulfatase amino terminal truncation lacking the first 24 amino acids (2-0 AN^' 
was PGR cloned as above except the forward primer 5 * 
TCTAGACATATGCAAACCTCAAAA GTAGCAGCT 3' (SEQ ID NO: 16) was used in 
place of original outside 5' primer listed, hi this DNA constmct, the 2-0 sulfatase-specific 

20 sequence begins with Q25 (Fig. 2) and reads MQTSKVAASRPN (SEQ ID NO: 1 7). 

Recombinant Expression and protein purification of a 6X histidine-tagged 2-0 sulfatase (and 
2-0 AA^'^^;— Both the fiiU-length enzyme and the truncated enzyme (2-0 AN^'^^) were 
recombinantly expressed in the E. coli strain BL2 1 (DE3) (Novagen) initially as NH2- 

25 tenrdnal 6X histidine fusion proteins to facihtate purification. The protocol for their 

expression and subsequent one-step purification by nickel chelation chromatography was as 
previously described for the A 4,5 glycuronidase (Myette, J. R., Shriver, Z., Kiziltepe, T., 
McLean, M. W., Venkataraman, G., and Sasisekbaran, R. (2002) Biochemistry 41(23), 7424- 
7434). Greater than 90% of the enzyme was eluted from a 5 ml column in a single 12.5 ml 

30 fraction following the addition of high imidazole elution buffer (50 mM Tris-HCL, pH 7.9, 
0.5 M NaCl, and 250 mM imidazole). The enzyme was immediately diluted with 2 volumes 
of cold enzyme dilution buffer (50 niM Tris, pH 7.5, 100 mM NaCl). Cleavage of the 6X 
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bistidine tag by thrombin was achieved by the step-wise addition of 10 units of biotinylated 
thrombin (total 50 miits) to 30 mL of diluted enzyme over the course of several hours while 
gently mixing by inversion at 4°C. Substantial precipitation of the sulfatase routinely 
occurred during the cleavage reaction. Thrombin was recovered by the addition of 
streptavidin agarose using the thrombin cleavage capture kit (Novagen). Capture was carried 
out a 4**C for 2 hours with gentle mixing. Bound thrombin was collected by centrifiigation 
for 5 minutes at 500 X g. Supernatant containing soluble 2-0 sulfatase was then dialyzed at 
4°C against 12 liters of enzyme dilution buffer using 20.4 mm diameter Spectra/Por dialysis 
tubing (Spectrum Laboratories, Rancho Dominguez, CA) with a 10,000 MWCO. Following 
dialysis, the purified sulfatase was concentrated using a Centriplus YMIO ultrafiltration 
device (Millipore, Watertown, MA). The enzyme was stable for at least two weeks at 4°C. 
Long-term storage was carried out at -85°C in the presence of 10% glycerol without any 
subsequent loss of activity due to fi-eezing and thawing. 

Protein concentrations were determined by the Bio-Rad protein assay and confirmed 
by UV spectroscopy using a theoretical molar extinction coefficient (6280) of 77,380 M"^ for 
2-0 AN^"^"* with the histidine tag removed. Protein purity was assessed by silver-staining of 
SDS-polyacrylamide gels. 

Computational methods — Sulfatase multiple sequence ahgnments were made fiom select 
BLAST? database sequences (with scores exceeding 100 bits and less than 6% gaps) using 
the CLUSTALW program (version 1 .81) preset to an open gap penalty of 10.0, a gap 
extension penalty of 0.20, and both hydrophilic and residue-specific gap penalties turned oa 
Signal sequence predictions were made by SignalP VI .1 using the von Heijne computational 
method (Nielsen, H., Engelbrecht, J., Brunak, S., and von Heijne, G. (1997) Protein Eng 
10(1), 1-6). 

Molecular mass determinations by MALDI-MS—Tho molecular weight of the 2-0 sulfatase 
NH2 truncated enzyme (2-0 AN^"^"^) was determined by matrix-assisted laser desorption 
ionization mass spectrometry (MALDI-MS) essentially as described (Rhomberg, A. J., Ernst, 
S., Sasisekharan, R., and Biemann, K. (1998) Proa Natl Acad Sci USA 95(8), 4176-81). The 
NH2-terminal histidine tag of the recombinant protein was cleaved by thrombin prior to mass 
analysis. 1 ^L of a 2-0 sulfetase solution (diluted in water to 0.5 mg/mL) was added to 1 fil 
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of a saturated sinapinic acid matrix solutioii previously deposited onto the plate. The - 
observed mass of the recombinant enzyme was corrected according to an external calibration 
using mass standards. 

5 2-0 sulfatase assay and determination of biochemical reaction conditions — 2-0 sulfatase 
activity was measiured using the unsaturated heparin trisulfated disaccharide AU2sHns.6s or 
the disulfated disaccharide AUzsHns as well as the disulfated disaccharide AUHns,6s lacking a 
sulfate at the 2-OH position. Standard reactions included 50 mM imidazole, pH 6.5, 50 mM 
NaCl, 500 |iM disaccharide, and 25 nM of enzyme (2-0 AN^'^"^) in a 20 jiL reaction volume. 

10 The reaction was earned out for 30 seconds at 30°C. Prior to its addition, the enzyme was 
serially diluted to 250 uM in ice cold IX imidazole buffer. The assay was initiated by the 
addition of 2|aL of this lOX enzyme stock to 18 jiL of reaction mixture. The enzyme was 
inactivated by heating at 95''C 1 1 for five minutes in pre-heated 0.5 mL eppendorf tubes. 
Desulfation at the 2-OH position of the disaccharide was measured by capillary 

15 electrophoresis. Resolution of substrate and product were achieved under standard conditions 
described for HSGAG compositional analyses (Rhomberg, A, J., Ernst, S., Sasisekharan, R., 
andBiemann, K. (199%) Proc Natl Acad Sci USA 95(8), 4176-81). Activity was generally 
measured as moles of desulfated product formed and was calculated from the measured area 
of the product peak based on molar conversion factors empirically determined firom standard 

20 curves. For the detection of mono- and di-sulfated disaccharide products, total 

electrophoresis time was 20 minutes. Each unsaturated disaccharide peak was detected by 
UV absorption at 232 nm. 

For pilot experiments measuring the relative effect of ionic strength on 2-0 sulfatase 
activity, the NaCl concentration was varied firom 0.05 to 1 M in 50 mM MES buffer (pH 6.5) 

25 that included 500 p-M of the disulfated disacchiide AU2sHns,6S and 50 nM enzyme. The 
effect of pH on sulfatase activity was assessed as a function of catalytic efficiency by 
measuring kinetic parameters in the following two overlappkig pH buffer systems ranging 
from 5.0 to 8.0: 50 mM MES at pH 5.0, 5.5, 6.5, and 7.0; 50 mM MOPS at pH 6.5, 7.0, 7.5 
and 8.0. Assays included 25 nM enzyme, 50 mM NaCl and varying concentrations of the 

30 disuffated disaccharide substrate AU2sHnS' Km and kcat values were extrapolated firom Vo vs. 
[S] curves fit to the MichaeUs- Menten equation by a non-linear least squares regression and 
the relative kcat/Km ratios plotted as a fimction of buffer pH. Based on this profile, relative 
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enzyme activity was also measured in four different buffers (MES, imidazole, ADA, and 
sodium phosphate) each present as a 50 naM concentration at pH 6.5. Relative activities were 
measured at a single saturating substrate concentration (4mM) usiag AU2sHns- 

Tandem use of2-0 sulfatase and A 4,5 glycuronidase in HSGAG compositional analyses — 
200 |ig of heparin was first digested with all three heparinases in an overnight digestion in 
glycuronidase reaction buffer which included 50 mM PIPES, pH 6.5, 50 mM NaCl and a 100 
\iL reaction volume. The heparinase digestion mix was split into 4 X 20 ^iL reactions which 
were individually treated as follows: Tube 1, no addition (heparinase only control); Tube 2, 5 
^g of A 4,5 glycuronidase, 30°C 1 hour; Tube 3, 5ng 2-0 sulfatase (2-0 AN^"^"^) 37°C, 1 hour; 
Tube 4, 2-0 sulfatase and A 4,5 glycuronidase added simultaneously, 30°C, 1 hour. A 4,5 
glycuronidase activity was ascertained by a disappearance of unsaturated disaccharide peaks 
due to the loss of UV absorption at 232 nm. 

The substrate-product relationship between the two enzymes was examined by 
directly measuring A 4,5 glycuronidase activity either before or following the addition of 
recombinant 2-0 sulfatase. Reactions were carried out at SO^'C and included 50 mM MES, 
pH 6.5, 100 mM NaCl, and 2 mM AU2sHns in a 100 fiL reaction volume. In these 
experiments, 250 nM A 4,5 glycuronidase and 25 nM 2-0 AN^'^"^ were sequentially added as 
follows: A 4,5 alone, A 4,5 followed by 2-0 sulfatase, or 2-0 sulfatase followed by A 4,5. In 
each case, the first enzyme was added to the reaction in a 2 minute preincubation step. A 4,5 
glycuronidase activity was measured immediately following the addition of the second 
enzyme by determining the rate of substrate disappearance as monitored by the loss of UV 
absorption at 232 nm (Myette, J. R., Shriver, Z., Kiziltepe, T., McLean, M. W., 
Venkataraman, G., and Sasisekharan, R. (2002) Biochemistry 41 (23), 7424-7434). A 4,5 
activity for the corresponding 2-0 desulfated disaccharide AUHns was also measured under 
identical conditions. 

Homology modeling of 2-0 sulfatase— Tho crystal structure of human arylsulfatase A, human 
arylsulfatase B, and the P. aeruginosa arylsul&tase (von Bulow, R., Schmidt, B., Dierks, T., 
von Figura, and Uson, L (2001) JMolBiol 305(2), 269-77) were used to obtain a 
structural model for the 2-0 sulfatase enzyme. A multiple sequence alignment was 
performed using CLUSTALW algorithm (Higgins, D. G., Thompson, J. D., and Gibson, T. J. 
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(1996) Methods Enzymol 266, 383-402) on the 2-0 sulfatase and the sulfatase sequences 
whose crystal stnictures have been solved (himaa arylsulfatase A, B and i>. aeruginosa 
arylsulfalase) (Figs. 9 and 16). Based on this multiple sequence aUgmnent, three model 
structures of 2-0 sulfatase were obtained corresponding to its alignment with the other three 
5 sulfatases. The models were constructed using the Homology module of Insight n molecular 
simulations package (Accebys, San Diego, CA). The side chain of the critical Cys 82 which 
is shown to mdergo posttranslational modification in the active enzyme was replaced by the 
geminal diol [Cjg(OH)2]. The potentials for the model stmctures were assigned using the 
AMBER force field (Homans, S. W. (1990) Biochemistry 29(39), 9110-8). The deletions in 
10 the modeled stmctuie were closed using 200 steps of steepest descent minimization without 
including charges by keeping most of the structure rigid and allowing the regions close to the 
deletion move freely. The final refined structure was subjected to 400 steps of steepest 
descent imnimization without including charges and 400 steps of conjugate gradient 
rnininiization including charges. 

15 

Molecular docking of disaccharide substrates into the active site of the modeled 2-0 
sulfatase — ^Heparin derived disaccharides with a AU at the non-reducing end were modeled 
as follows. The coordinates of thetrisulfated AU containing disaccharide (AU2sHks,6s) were 
obtained from the co-crystal structure of a heparinase derived hexasaccharide with fibroblast 

20 growth factor 2 (PDB id: IBFC). This trisulfated disaccharide structure was used as a 
reference to generate the structural models for other disaccharides including AU2sHns, 
AU2sH>rAc aiid AU2sHnac,6s. The coordinates of trisulfated disaccharides (I2sHns,6s) 
containing iduronic acids in the ^Ca and ^So conformations were also obtained from IBFC 
(PDB id: IBFC). Similarly chondioitin sulfate derived disaccharides AU2sGalNAc,4s and 

25 AU2sGalNAc,6S were modeled using a reference structure of a chondroitin-4 sulfate 

disaccharide AUGalKAc,4S whose coordinates were obtained from its co-crystal structure with 
the chondroitinase B enzyme (PDB id: IDBO). The potentials for these disaccharides were 
assigned using the AMBER force field modified to include carbohydrates (Homans, S. W. 
(1990) Biochemistry 29(39), 9110-8) with sulfate and sulfamate groups (Huige, C. J. M., 

30 Altona, C. (1995) 1 Comput Chem. 16, 56-79). 

The orientation of the cleavable sulfate group relative to O7I of the geminal diol in 
the active site of human arylsulfatase A and the bacterial arylsulfatase was identical as 
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observed in their respective crystal structures. This orientation was such that one of the faces 
of the tetrahedral formed by the 3 oxygen atoms of SO3" was oriented towards Oyl 
facilitating the nucleophilic attack of the sulfur atom and the transfer of the SO3" group to 
O7I (Waldow, A., Schmidt, B., Dierks, T., von Bulow, R., and von Figura, K. (1999) J Biol 

5 Chem 274(18), 12284-8). This highly specific orientation of the sulfate group helped in 
positioning the disaccharide substrates relative to the active site of the 2-0 sulfatase. After 
fixing the orientation of the 2-0 sulfate group, the glycosidic torsion angles and exocycUc 
torsion angles were adjusted manually to remove unfavorable steric contacts with the amino 
acids in the active site. The enzyme substrate complexes were minimized using 200 steps of 

10 steepest descent followed by 400 steps of Newton-Raphson minimization including charges. 
Most of the enzyme was kept rigid and only the loop regions constituting the active site were 
allowed to move freely. To model the disaccharide structure, a forcing constant of 7000 
kcal/mole was appUed to the ring torsion angles during the energy minimization calculations 
while simultaneously fixing the ring conformation of the individual monosaccharide units. 

15 The manual positiomng of the substrates was done using the Viewer module, building of the 
disaccharide structures from the reference structures was done using the Builder module and 
the energy minimization was done using the Discover module of Insight U. 

Heparin compositional analyses by capillary electrophoresis andMALDI-MS — 
20 Approximately 10 p.g of the AT- 10 oUgosaccharide were incubated with 100 picomoles of 2- 
O ANf^"^"^ in a 40 nL reaction volume at 30°C. 15 aHquots were removed at 4 hours and 17 
hours and heat inactivated at 95°C. The oUgosaccharide reaction products (along with 15 ^iL 
of a minus sulfatase control) were subjected to an exhaustive heparinase I and IE digestion 
prior to CE-based compositional analysis. Desulfation of the decasaccharide was assayed in 
25 parallel by MALDI-MS using established methods (Rhomberg, A. J., Ernst, S., Sasisekharan, 
R., andBiemann, K. (1998) Proc Natl Acad Sd USA 95(8), 4176-81.). 

Substrate specificity and kinetics experiments using different disaccharide substrates — ^For 
substrate specificity experiments, the following heparin disaccharide substrates were used: 
30 AUisHnac, AU2sHnac.6s, AU2sHns, and AU2sHNs,ds. In addition, the chondroitin disaccharides 
AU2sGalNAc,4s and AU2sGalNAc.6S were also studied. Disaccharide concentrations for each 
respective substrate were varied Scorn 0.1 mM to 4 mM. Initial rates (Vo) were extrapolated 
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jfrom linear activities representing <20% substrate turnover and JBt to pseudo first-order 
kinetics. Standard reactions included 50 mM imidazole, pH 6.5, 50 mM NaCl, 500 ^lM 
disaccharide, and 25 nM of enzyme (2-0 AN^'^"^) in a 20 tiL reaction volinne. The reaction 
was carried out for 30 seconds at 30^C. Prior to its addition, the enzyme was serially diluted 

5 to 250 nM in ice cold IX imidazole buffer. The assay was initiated by the addition of 2jiL of 
this lOX enzyme stock to 18 fiL of reaction mixture. Sulfatase activity was inactivated far 
five minutes at 95°C in pre-heated 0.5 mL eppendorf tubes. Desulfation at the 2-OH position 
of the disaccharide was measured by capillary electrophoresis. Resolution of substrate and 
product were achieved under standard conditions described for HSGAG compositional 

10 analyses (Venkataraman, G., Shriver, Z., Raman, K, and Sasisekharan, R. (1999) Science 
286(5439), 537-42). Activity was measured as moles of desulfated product formed and was 
calculated from the measured area of the product peak based on molar conversion factors 
empirically determined from standard curves. For the detection of mono- and di-sulfated^ 
disaccharide products, total electrophoresis time was 25 minutes. Each unsaturated 

15 disaccharide peak was detected by UV absorption at 232 nm. All the substrate saturation 
kinetics were measured under Michaelis-Menten conditions. 

2-0 sulfatase active site labeling and peptide mapping — Approximately 500 of 6X 
histidine-tagged 2-0 AN^'^* (wild-type enzyme and C82A site-directed mutant) were 

20 lyophilized by Speed-Vac centrifugation and vigorously resuspended in 90 jiL denaturation 
buffer containing 6M guandinium hydrochloride, 0.1 M Tris-HCL, pH 7.5. Active site 
aldehydes were fluorescently labeled by adding 25 of Texas Red hydrazine made up as a 
10 mM stock in dimethyl formamide (DMF). Labeling was carried out for three hours at 
room temperature with gentle mixing on a rotating platform. The hydrazone Unkage was 

25 stabilized by the addition of 10 [iL of a fresh 5M sodium cyanoborohydride stock made up in 
IN NaOH. Reduction was carried out for 1 hour at room temperature. Umeacted 
fluorophore was removed by repeated acetone precipitation (added 5:1 v:v). Acetone was 
prechilled at -20''C. Samples were chilled at -85*'C for 20 minutes prior to spiiming in a 
microfuge for 10 minutes, maximum speed, at 4^C. Pellets were briefly dried by Speed-Vac 

30 centrifugation. 

The labeled sulfatase (and unlabeled control) were proteolyzed with sequence grade- 
modified trypsin for 20 hours at 37°C in digestion buffer that contained 0. 1 M Tris-HCL, pH 
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8.5, 1 mM EDTA, 1 mM DTT and 10% acetonitrile (v/v) in a 30 nL reaction volume. 
Trypsin was first reconstituted as a 2.5 mg/mL stock in 1% acetic acid and added at a 1:5 
ratio (w/w) relative to the target protein. Following trypsin digestion, peptide cj^eines were 
reduced by the addition of 50 mM DTT (50°C under argon, 1 hour). Reduced cysteines were 
5 subsequently alkylated for 30 minutes at 37°C (in the dark) by the addition of 150 mM 

iodoacetainide, added firom a 2M stock made up in O.IM Tris-HCL, pH 8.5. This reduction- 
alkylation cycle was repeated one more time. 

Molecular masses of select peptides were determined by MALDI-MS as described 
(Myette, J.R., Shriver, Z., Liu, J., Venkataraman, G., Rosenberg, R., and Sasisekharan, R. 
10 (2002) Biochem Biophys Res Contmun 290(4), 1206-13) using 1 ^L of a-cyano-4- 
hydroxycinnamic acid (CHCA) in 50% acetonitrile, 0.3% TFA as a matrix. 

Site-directed mutagenesis of the C82A active site mutant—The site-directed mutant C82A 
was cloned by recombinant PGR using outside primers 5' TCT AGA CAT ATG CAA ACC 

15 TCA AAA GTA GCA GCT 3* (forward, SEQ ID NO: 18) and (5' GT CTC GAG GAT CCT 
TAT TTT TTT AAT GCA TAA AAC GAA TCC 3' (reverse, SEQ ID NO: 19) in addition to 
the following mutagenic primer pair: 5' C GAG CCG CTC GCT ACA CCT TCA CG 3 ' 
(forward, SEQ ID NO: 20) and 5' CG TGA AGG TGT AGC GAG CGG CTG G V (reverse, 
SEQ ID NO: 21). The engineered codon change for each DNA strand is underlined. 

20 Subcloning into pET28a, recombinant expression in the £. coli strain BL21 (DE3), and 
subsequent purification by nickel chelation chromatogr^hy using the N-teraiinal 6X 
histidine purification tag are as described above for 2-0 AN^"^"^. 

Circular ^icAroiym— Recombinant^ expressed 2-0 sulfatase and the inactive C82A mutant 
25 were concentrated and buffer-exchanged into 50 mM sodium phosphate, pH 7.0, using a 
Centricon 10 ultrafiltration device (MiUipore). CD spectra were collected on an Aviv 62DS 
spectropolarimeter eqixipped with a thermostatic temperature control and interfaced to an 
IBM microcomputer. Measurements were performed in a quartz cell with a 1 mm path 
length. Spectra were recorded at 25°C in an average of 5 scans between 205 and 270 nm 
30 with a 1 .0 nm bandwidth and a scan rate of 12 nm/min. CD band intensities are expressed as 
molar ellipticities, 5M, in degrees-cm^dmol ^ 
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Results 

Molecular cloning and recombinant expression of the F. hepannum 2-0 sulfatase— As a first 
step towards the cloning the 2-0 sulfatase gene, we purified the enzyme directly jfrom the 
5 native bacterium followed by a partial determination of its amino acid sequence. After a five- 
step chromatographic fractionation of flavobacterial lysates, we achieved a greater than 3000- 
fold purification of sulfatase activity. Further firactionation of this activity by reverse phase 
HPLC chromatography yielded two separate polypeptides (Fig. 1, Panel (A)). Both proteins 
were subjected to a limit trypsin digestion and the resultant peptides likewise purified by 

10 reverse phase HPLC (Fig. 1, Panel (B)). From select peak 1 peptide sequences, degenerate 
primers were synthesized. We initially screened primer pairs corresponding exclusively to 
peak 1 protein sequence (Table 1), given the fact that this sulfatase fi-action represented the 
major protein species present in the final purification step. PGR amplification of genomic 
DNA using degenerate primers conresponding to peptide peaks 3 and 5 yielded a discrete 600 

15 bp DNA product. Sequence analysis of this amplified DNA indicated a translated amino acid 
sequence to which three of the isolated peak 1 peptides mapped. We used this DNA, 
therefore, as a hybridization probe to screen a XZAP flavobacterial genomic hbrary and 
isolate a full-length clone. Several positive clones were isolated; most of them contained an 
average insert size between 4-5 kb. One genomic clone of approximately 7 kb (clone S4A) 

20 was subjected to direct DNA sequencing. This clone contamed at least one open reading 
firame (ORF) in particular that encodes a putative protein of 468 amino acids in length (464 
amino acids firom first methionine) and whose primary sequence includes all of the sulfatase 
peptides for which we had obtained sequence information (Fig. 2). Based on its amino acid 
composition, the encoded protein is quite basic (theoretical pi of 8.75), with 67 basic side 

25 chains comprising 14 approximately 14% on a molar basis. The putative sulfatase also 
possesses 8 cystemes m addition to 46 aromatic amino acids. 
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Table 1: 2-O-sulfatase peptides and cx)rrespondmg degenerate primers 



Peak 
No. 


Peptide Sequence 


Degenerate Primers 


2 


YIVYDKGEIR (SEQ ID NO:22) 


5* TAYATHGTNTAYGAYAARGG 3* fSEO ID NO:27) 
5' NCCYTTRTCRTANACDATRTA V (SEQ ID NO:28) 


3 


TYPSVGWNESQWR (SEQ- ID. NO:23) 


5' CARCAYGGNTTYGARACNAT 3' fSEO ID NO:29) 
5' DATNGTYTCATTNCCRTGYTG 3* (SEQ ID NO:30) 


4 


KMPHETGFTGNTPEKDGQWPDSVLMMGK 
(SEOIDNO:24) 


5' TAYATHGTOTAYGAYAARGG 3' (SEC ID N0:3n 
5' NCCYTTRTANACDATRTA 3' (SEQ ID NO:32) 


5 


VAQHGFETIENTGMGDYTDAVTPSQCANFNK 
rSEOIDNO:25) 


5' ATHGAYATHATHCCNACNAT 3' fSEQ ID NO:33) 
5' DATNGTNGGDATDATRTCDAT3' (SEQIDNO:34) 


8 


TDDQLVCNGIDIIPnCGFAGIAK 
(SEQIDNO:26) 


5' GAYATHATHCCNACNATHTGYTT 3' fSEO IDNO:35) 
5* AARCADATNGTNGGDATDATRTC 3' (SEQ ID NO:36) 



Select RP-HPLC purified tryptic peptides (see also Fig. 1, Panel (B)) were subjected to amino acid sequencing. 
5 Also shown are the corresponding degenerate primers. 

Upon a closer examination of its primary sequence, we also identij&ed a conserved 
sulfatase domain. This signature domain included the consensus sequence 
C/SXPXRXXXXS/TG (SEQ ID NO: 6) presumably comprising (at least in part) the sulfatase 

10 active site and possessing the cysteine (denoted in bold) that is most likely modijSed as a 

formylglycine.in vivo. The putative 2-0 sulfatase that we cloned from F. heparinwn exhibits 
substantial homology to many members of a highly conserved sulfatase family (Fig. 3) 
(Bond, C. S., Clements, P. R., Ashby, S. J., Collyer, C. A., Harrop, S. J., Hopwood, J. J., and 
Guss, J. M. (1997) Structure 5(2), 277-89, Parenti, G., Meroni, G., and Ballabio, A. (1997) 

15 Curr Opin Genet Dev 7(3), 386-91). A structurally-oriented description of this homology 
and its correlation to enzyme function is found below. 

From this sequence information, we were confident that we had indeed cloned a 
sulfatase from the flavobacterial genome. To ultimately establish its functionality, we next 
set out to recombinantly express this protein in E. colii The ftdl-length gene (be ginnin g at the 

20 first methionine noted in Fig. 2) was subcloned into the T7-based expression vector, pET28a 
for expression as an NH2-terminal 6X histidine-tagged protein to facilitate purification. 
Induction with IPTG led to a limited soluble expression of a polypeptide whose apparent 
molecular weight roughly corresponded to the theoretical mass of the fiision protein 
(approximately 54 kDa). Using Ni^^ chelation chromatography, we were able to partially 

25 purify this polypeptide from the bacterial lysate and unequivocally measure 2-0 specific 
sulfetase activity using the trisulfated, unsaturated heparin disaccharide AUzsHns^ ^ ^ 
substrate. 
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We identified a putative signal sequence for the flavobacterial 2-0 sulfatase 
comprised of the fixst 24 amino acids (see Fig. 2). By engineering a 2-0 sulfatase N-tenninal 
truncation lacking this sequence (herein referred to as 2-0 AN^'^"*), we achieved high 
expression levels of soluble, highly active enzyme. Protein yields exceeding 1 00 mg of 

5 relatively pure sulfatase per liter of induced bacterial cultures were routinely achieved using a 
single chromatographic step (Fig. 4). The specific activity of the recombinant sulfatase was 
considerably enhanced following the removal of the N-terminal 6X-histidine tag by thrombin 
cleavage. Removal of this purification tag resulted in a greater than 10-fold purification of 
sulfatase activity relative to the crude bacterial lysate (Table 2). For this reason, we used the 

10 cleaved protein in all subsequent experiments. The molecular weight of this recombinantly 
expressed sulfatase as determined by MALDI-MS is 50,120.8 Daltons. This empirical value 
closely agrees with its theoretical mass of 49,796 Daltons that is based entirely on its amino 
acid composition. 

15 Table 2: Purification of recombinant 2-O-sulfatase 



Fraction 


Total Protein 


Specific Activity 

fnanomoles of EMSMn/u^ jjroteinl 


Fold-poriificatiou 


lysate 


322 


414 






122 


7,43 


K8 


His Tflfi lemoval 


15* 


413 


10.7 



200 ng of total protein from each pmification step was assayed for 2-O-sulfatase activity as described in 
Materials and Methods using the unsaturated heparin disaccharide (DiS) U2sHms as a substrate. 
Fold purification is relative to crude bacterial lysate. 
20 ^Soluble enzyme remaining after substantial loss due to protein precipitation. 

To establish the recombinant enzyme's exclusivity for the uronic acid 2-0 sulfate, we 
initially compared two related unsaturated heparin disachharides: AU2sHns.6S versus 
AUHns,6s- The recombinant sulfatase only hydrolyzed a single sulfate, namely, the one found 
25 at the 2-OH position (Fig. 5). 

Biochemical cojiditiom for optimal in vitro acrivzfy— Having successfiiUy achieved the 
recombinant expression and purification of the flavobacterial sulfatase as a soluble enzyme as 
well as demonstration of its unequivocal specificity for the uronic acid 2-0 sulfate, we next 
30 set out to define the reaction conditions required for optimal enzyme activity in vitro. These 
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parameters included pH, temperature, ionic strength, and possible divalent metal ion 
dependency. In brief, the enzyme exhibited a pH activity range between 6.0 and 7.0, with 
• optimum activity occurring at pH 6.5 (Fig. 6, Panel (A)). The enzyme was essentially 
inactive at the outiying pH values of 5.0 and 8.0. In terms of different buffer systems (all at 
5 pH 6.5), an imidazole-based buffer demonstrated the highest relative activity as compared 
with buffers containing 50 mM MES, ADA, or phosphate. As expected, phosphate buffer 
was clearly inhibitory (Fig. 6, Panel (A) inset). 

We also examined 2-0 sulfatase activity relative to ionic composition. The 
recombinant enzyme was optimally active at approximately 50 mM NaCL Activity was 
10 sharply inhibited by [NaCl] exceeding 100 mM, with 50 % inhibition occurring at less than 
250 mM NaCl (Fig. 6, Panel (B)). Maximal enzyme activity was largely uuafifected by the 
addition of EDTA up to a 1 mM concentration. Addition of exogenous CaCl2, MgCla, or 
MnCk (up to 10 mM) also had no substantive effect, indicating that these particular divalent 
metal ions are not required. A preincubation of the enzyme with 5 mM EDTA did result in 
1 5 an approximately 1 0 % inhibition of activity using the trisulfated disaccharide as a substrate. 
37°C was the default temperature at which all of the preliminary biochemical 
experiments were conducted. We measured both relative enzyme activity and stability as a 
ftmction of varying reaction temperature (Fig. 6, Paenl (C)). The 2-0 sulfatase was active 
over a fairly broad temperature range (25*^0 to 37°C), with optimal activity occurring at 30^*0. 
20 Enzyme activity was compromised at 42°C. Enzyme stabihty at this temperature was 
likewise affected as assessed in pre-incubation experiments conducted at varying 
temperatures (30°C -> 42°C) prior to measuring 2-0 sulfatase activity at 30°C. 

The substrate-product relationship between the 2-0 sulfatase and A 4,5 gfycuronidase — ^As 
25 we have akeady noted, the flavobacterial A 4,5 glycuronidase is unable to hydrolyze 

imsaturated saccharides possessing a inonic acid 2-0 sulfate at the non-reducing end (Myette, 
J. R., Shriver, Z., Kiziltepe, T., McLean, M. W., Venkataraman, G., and Sasisekharan, IL 
(2002) 5zocAe/7Iw^ry 41(23), 7424-7434). We hypothesized that, an obUgatory substrate- " 
product relationship between the 2-0 sulfatase and the A 4,5 glycuronidase may exist. We 
30 examined a possible kinetic relationship between these two enzymes by looking at their 
sequential action (Fig. 7). Iq this experiment, A 4,5 glycuronidase activity was measured 
directiy either during or followiag the addition of the recombinant 2-0 sulfatase using the 
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disaccharide substrate AUzsHns- When this- disaccharide was incubated with the A 4,5 
eazyme alone, it was completely refractory to glycuronidase-mediated hydrolysis as 
measured by a loss of absorbance at 232 mn. A 2 minute preincubation of the substrate with 
the 2-0 sulfatase, however resulted in robust linear glycuronidase activity. This rate yras 
5 comparable to the rate of hydrolysis measured for the control substrate AUHns using the A 
4,5 enzyme alone. In the reciprocal experiment (i.e., whereby the 2-0 sulfatase was added 
second), we observed an initial lag in A 4,5 activity. This lag was followed by a linear A 4,5 
activity, albeit at a slower rate than in the case where the 2-0 sulfatase was added first. The 
observed delay in activity was presumably due to the prerequisite 2-0 desulfation of the 
10 substrate which must occur prior to being acted on by the glycuronidase. This experiment 
clearly demonstrates at least a functional linkage between these two HSGAG degrading 
enzymes. 

With the results just described, we considered the parallel use of these two enzymes 
(along with the heparinases) as complementary tools for HSGAG compositional analyses. 

15 The utility of this combinatorial approach is shown in Fig. 8. 200 |ig of heparin were first 
subj ected to an exhaustive heparinase treatment. Subsequent treatment of the cleavage 
products with the A 4,5 glycuronidase resulted in the disappearance of select saccharide 
peaks, namely those that did not possess a 2-0 sulfated uronic acid at the non-reducing end 
(Fig. 8, Panel (B)). Conversely,, subsequent treatment of the heparinase-derived saccharides 

2D with the 2-0 sulfatase results in both the disappearance of 2-0 sulfated disaccharides as well 
as a concomitant appearance of their desulfated products (Fig. 8, Panel (C)). When both the 
A 4,5 glycuronidase and the 2-0 sulfatase were added simultaneously to the heparinase 
cleavage products, essentially all of the saccharides were hydrolyzed by the A 4,5 
glycuronidase as evident by a lack of any UV absorbable electrophoresis products (Fig. 8 

25 Panel P). 

Stiucture-based homology modeling of the 2-0 sulfatase active site — The crystal structures 
of three sulfatases have been solved- These sulfatases are human arylsulfatase A (Lukatela, 
G., Krauss, N., Theis, fC., Selmer, T., Giesehnann, V., von Figura, K, and Saenger, W. 
3D (1 998) Biochemistry 37(1 1), 3654-64, von Bulow, R,, Schmidt, B., Dierks, T., von Figura, 
IL, and Uson, I. (20O1) JMol Biol 305(2), 269-77), arylsulfatase B (N-acetylgalactosamine- 
4-sulfatase) (Bond, C. S., Clements, P. R., Ashby, S. X, Collyer, C. A., Harrop, S. L, 
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Hopwood, J. J., and Guss, J, M. (1997) Structure 5(2), 277-89), and a bacterial arylsulfatase 
&om Pseudomonas aeruginosa (Boltes, L, Czapinska, H., Kahnert, A., von Bulow, R., 
Dierks, T., Schmidt, B., von Figura, KL, Kertesz, M. A., and Uson, L (2001) Structure (Camb) 
9(6), 483-91). In comparing their structures, we observed a structural homology between 
5 each of them, especially as it pertained to a conservation of critical active site residues and 
their spatial arrangement. By extension, most of these amino acids were likewise conserved 
in the flavobacterial 2-0 sulfatase as evident by a direct alignment of their primary sequences 
(Figs. 9 and 16). We used this close structural relationship to constmct three homology- 
based models for the flavobacterial 2-0 sulfatase, each one based on one of the three crystal 

10 structures examined. We ultimately chose as our representative 2-0 sulfatase structure the 
homology model constructed using the N-acetylgalactosamine-4-sulfatase (arylsulfatase B) 
(Fig. 10). This decision was largely based on it also being a GAG desulfating enzyme. In 
this model, we replaced cysteine 82 with a formylglycine (FGly 82). We chose to represent 
FGly 82 in the hydrated state as a geminal diol [-C^0H)2], consistent with the proposed 

15 resting state (before catalysis) of the enzyme (Lukatela, G., Krauss, N., Theis, K., Selmer, T., 
Giesehnann, V., von Figura, K., and Saenger, W. (1998) Biochemistry 37(11), 3654-64, 
Waldow, A., Schmidt, B., Dierks, T., von Bulow, R., and von Figura, K. (1999) J Biol Chem 
274(18), 12284-8). 

Upon inspection of the 2-0 sulfatase structure, several amino acids that potentially 
20 constitute the active site were identified (Table 3). There are several structurally conserved 
basic amino acids in the proximity of FGly 82 including Arg 86, Lys 134, ffis 136 and Lys 
308. The topology of the active site as observed in our structural model indicated that the 
critical FGly 82 and the basic amino acid cluster are located at the bottom of a deep pocket 
(Fig. 10, Panel (B)). Such restrictive access to the active site would appear to impose a clear 
25 structural constraint on the substrate as it relates to the position of the 2-0 sulfite group 
within the ohgosaccharide chain (i.e., externally vs. internally positioned) upon which the 
enzyme acts. We predicted from this topology that a sulfate group present at the non- 
reducing end of the oligosaccharide will be favorably positioned for catalysis; the 
juxtaposition of an internal sulfate iuto the active site would require a substantial bending of 
30 the oligosaccharide chain. Such chain distortion would be sterically unfavorable. Based on 
these constraints, therefore, we predicted the sulfatase to hydrolyze 2-0 sulfates in an 
exclusively exolytic fashion. This exclusivity for the non-reducing end does not necessarily 
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preclude, however, the enzyme acting on longer chain oligosaccharides (i.e., those exceeding , 
a disaccharide in length) provided that they in fact possess sulfates at the terminal 2-OH 
position. The model does suggest a likely kinetic preference for disaccharide suhstrates as 
they would most readily diffuse into and out of this narrow active site (see enzyme-substrate 
5 structural modeling below). 

Table 3: Structure-based comparison of sulfatase active site residues 



2-0 sulfatase 


AiylsuIfatascA 

Htamn 


ArylsnUataseB 

Hitman 


AryJsnlfatase 


Cy^l 


Cys»69 


Cya-91 








Ai^95 


Ai^-55 


hys-124 


L>'S-123 


Ly5.i4S 


Lys^lU 


Hls-136 


ms-125 


Hi5-147 


HI5-115 


Ly5-30S 


Lys-302 




Lys^37S 


Gln-237 


Hi5-229 


His-242 


Hi5*Zll 


Asp-42 


A5P-29 


Asik53 


Asp-13 




Asp-SO 


A5P-54 


Asp-14 


Asp-295 


Asp*281 


A£p-300 


Asp^l7 


Hi5^296 




A3B*30i 


Aso31& 


Lys-238* 




GIa-243 


Trp-2n 




G1a-lS3 


Ar^lSO 


Pco-161 




Hi5-151 


Sfir-^172 


Ala-U9 


Thr-1G4= 


Val-91 






Glu-106 


Val-93 


Trp-llS 








Pro'116 




6111-359^' 




Trp-319 


Ala-176 



Highly conserved amino acids are listed in black. Non-conserved anuno acids are listed in gray. Amino acids 
10 in the 2-0 sul&tase that could be potentially involved in substrate binding are noted by an asterisk. Stnictural 
alignment of the modeled 2-0 sulfetase stracture with the other sulfatases was obtained based on superposition 
of their Ca traces using the combinatorial extension algorithm (McLean, M. W., Biuce, J. S., Long, W. F., and 
Williamson, F. B. (1984) Eur JBiochem 145(3), 607-15). Regions of deletion in the stmctural alignment axe 
noted with a minus sign. 

15 

The surface of the active site pocket is comprised of many amino acids that can 
potentially interact with a disaccharide substrate. These include Lys 107, Lys 175, Lys 238, 
Ghi 237 and Gin 309, Thr 104, Glu 106 and Asp 159. Lysines and glutamines are commonly 
occinring amino acids in heparin binding sites that interact with the sulfate and carboxylate 
20 group. Unlike the amino acids proximal to the FGly 82, these residues are not conserved in 
the other sulfatases that we examined (Table 3, denoted in gray), suggesting a potentially 
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unique role of these amino acids in dictating oligosaccharide substrate specificity. This 
disparity is particularly true when directly comparing the 2-0 sulfatase and arylsulfatase A; 
many of the non^jonserved amino acids in the 2-0 sulfatase are charged while those in 
arylsulfatase A are predominantly hydrophobic. This observation is consistent with the 
5 structural distinction of their respective substrates, i.e., the highly sulfated HSGAG substrates 
of the 2-0 sulfatase vs. the long hydrophobic alkyl chains of cerebroside-3-sulfate substrate 
of arylsulfatase A. 

Enzyme-substrate structural complex: Interaction between 2-0 sulfatase and disaccharides — 
10 Since the active site can readily accommodate disaccharide substrates, we modeled several 
unsaturated glycosaminoglycan disaccharides. Our choice of A 4,5 unsaturated substrates 
was logical for two reasons: 1) jS-eliminative cleavage of a HS polysaccharide by the 
flavobacterial lyases that naturally occurs in vivo results in the formation of disaccharides 
and other small oligosaccharides all possessing a A4-5 unsaturated bond at the non-reducing 
15 end uronic acid and; 2) the obligatory substrate-product relationship between the 2-0 

sulfatase and the A 4,5 glycuronidase that exists botii in vitro and in vivo, A representative 
structural complex involving the trisulfated disaccharide AU2sHns,6s (Fig. 1 1) was used to 
describe the molecular interactions between the enzyme and the substrate. This choice was 
ultimately validated by the substrate kinetics. A description of these interactions and their 
20 proposed functional roles is shown in Table 4. The functional roles of the conserved active 
site amino acids (hsted in bold) were proposed based on their interactions with the 2-0 
sulfate group and/or the gemiaal diol of the formylglycine at position 82. Identical roles have 
been proposed for the corresponding amino acids in the three known sulfatase crystal 
structures (Table 3). 
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Table 4. Functional assignment of 2-p-sulfatase active site amino acids — 



Active siK 
Amliu) acids 


T>rnnfsed fnnctiDiistl rolfi 




Mcdiiied info hydrated fbnn of the FGly - Oyl posltioaed for nudeophllic attack oa 
sulfate ^tjp^ 


Ar£-S6,His-136 


Stabfliziiie &e hydrated FGIy by intEractiott mlh 0% His-136 is also portioned 
finroraljly for abstiactiDa of protoi flrom Oil after catalysis to eHminate the aulfate 
EToup and r^gerKrate eemiaal diol 


Lys-IH Lys-30S, Gln- 
237 


Coordinate with the oocygsn atoms of 2-0- sulfhte group to enhance elechoa deasity 
wlUidrawal fiom sulfate group thereby Increnstng the electoophlliclty of snlftir 
center. LyflOS is also positioned to protDiiat& the oxygen atom on the leaving 
substrate. 


Asp-295 


Enhances nudeophilidty of ty proton danatian , 


Lys.23S, Ly5-175 


Interactkm with planar carboijl gronp of AU may bs critical for suljstmte 
recognitian. and poationing the 2-0 sulfate gnmp. 


Thr-104,L>'S-107 


Interactlan with 6-0 sulfate on elucosamlne may he critical for positionins of 2-0 
sulfate ffovp. 


Leu-39a, Leu-391, Leu- 
392 


Better positioned to makB favorabile hydrophohi c contacts witli fte N-^cetyl group. 



The ammo acids listed in the first column were identified by inspection of the structural model presented in 
Fig. 3. The critical active site Cys-82 is indicated in boldface. 

5 

A closer inspection of the modeled enzyme-substrate complex revealed some 
interesting possibilities pertaining to the role of the non-conserved amino acids in substrate 
recognition and binding. The planar carboxylate group attached to the C5 atom of the A4-5 
uronic acid is oriented in such a manner as to potentially interact withLys 175, Lys 238. 

10 These amino acids could play an important role, therefore, in favorably orienting the 2-0 
sulfate within the active site. We were further interested in this arrangement given the 
additional constraint imposed upon the planar carboxyl group of the uronic acid by the 
presence of the C4-C5 double bond. This constraint may further influence substrate 
orientation within the active site. Given this possibiUty, we predicted a substrate 

15 discrimination exhibited by the 2-0 sulfatase which is based on the presence of the A 4,5 
double bond at the oligosaccharide non-reducing terminus. In the absence of this double 
bond, the favorable orientation of the 2-0 sulfate and the C5 carboxylate afforded by charge 
interactions with Lys 178 and Lys 238, respectively, would not occur. 

To better understand this likely stmctural constraint, we superimposed onto our 

20 trisulfated model substrate disaccharides containing a non-reducing end iduronic acid in 

either the ^Ca or ^So conformation. The superimposition was such that the S-0-C2-C1 atoms 
of all the uronic acids coincided, thereby fixing the orientation of the 2-0 sulfate group, hi 
this model, the carboxylate groups of the iduronic acid containing disaccharide substrates 



. . ^^immmi^u WO20C(^2592 _ . : - . . .PCTAJS2004/000332^^i1SWmE>i. 7 WO 201 

-75- 

were, in fact, pointing away from the.active site pocket and were not positioned to interact as 
favorably with the active site amino acids (i.e., Lys 175, Lys 238) as compared with the 
original disaccharide substrate possessing a planar C5 carboxylate. 

Our structural model of a sulfatase-trisulfated disaccharide complex also points out 
5 key interactions involving additional sulfates (other than the uronic acid 2-OH position) 
present on the adjoining glucosamine, hi particular, the 6-0 sulfate group interacts with the 
basic side chain of Lys 107 within the enzyme active site (Fig. 1 1). This putative charge 
interaction would likely play an important role in stabilizing the orientation of the substrate in 
the active site. In contrast, the N-sulfate group of the disaccharide glucosamine is proximal 
10 to a contiguous stretch of leucines (390-392). In such an arrangement, it is the methyl group 
of an N-acetylated glucosamine rather than a sulfate at this position which is more likely to 
make favorable hydrophobic contacts with these residues. This prediction was borne out in 
one of our models docking the AU2sHnac,6S substrate in the active site. 

We also modeled enzyme-substrate complexes containing two imsaturated 
15 chondroitin sulfate disaccharides (AU2sGalNAc,4S and AU2sGalNAc.6s). hi comparison to our 
original model using the heparin disaccharide substrate, we foimd interactions with the 2-0 
sulfate and carboxyl group of the AU monosaccharide that were identical to that of 
AU2sHns,6S. There were few mteractions involving the 4-sulfate and 6-sulfate groups, 
however. This particular model, therefore, does not exclude the abihty of the so-called 
20 "heparin/heparan sulfate" 2-0 sulfatase to hydrolyze 2-0 sulfated chondroitin disaccharides. 
Given a lack of additional favorable contacts between the enzyme and substrate (e.g., with 
either the 4-0 or 6-0 sulfates), we would anticipate a lower catalytic efficiency for the 
chondroitin disaccharides relative to the structurally corresponding heparin disaccharides. 

In discussiug this model, we must briefly consider the potential role of divalent metal 
25 ions. We decided not to include any such metal ions in our model of the 2-0 sulfatase as we 
could find no divalent metal requirement for enzymatic activity. A divalent metal ion is 
present, however, in all three sulfatase crystal structures that we examined. In each case, the 
metal ion coordinates with the oxygen atoms of the sulfate group of the respective substrate. 
Additionally, a cluster of four highly cons^ved acidic amino acids has been observed to 
30 coordinate with this divaloit metal ion. In the case of human arylsulfatase B, for example, 
the oxygen atoms of Asp 53, Asp 54, Asp 300 and Asn 301 are coordinated with a Ca ion. 
Three of the four corresponduig amino acids in the flavobacterial sulfatase model that we 
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have identified as potentially coordinating with a metal ion are Asp 42, Ghi 43 and Asp 295 
(Table 3). The fourth amino acid in the 2-0 snlfatase corresponding spatially to Asn 301 of 
axylsulfatase B is His 296. The positive charge of this position, however, does not favor the 
proximal location of a divalent metal cation. It is perh^s this unfavorable charge interaction 
5 which interferes with proper metal ion coordination. 

Enzyme-substrate model: Mechanism for catalysis — Nearly identical mechanisms for the 
hydrolysis of the sulfate ester bond involving the conserved active site amino acids have been 
proposed for human arylsulfatases A and B and the bacterial sulfatase from Pseudomonas 

10 aeruginosa. The resting state of the active sulfatase in each of the crystal structures is 

proposed to contain the gemiaal diol which is stabihzed by iateractions with basic residues. 
His 136 and Arg 86 of the flavobacterial enzyme are positioned appropriately in the active 
site to do so (Fig. 10, Panel (B)). A critical step in catalysis involves the correct positioniag 
the 2-0 sulfate group such that the sulfur atom is accessible to the O7I of the geminal diol. 

15 We have already described how interactions of specific active site amino acids with the 
planar carboxyl group of the mronic acid (Lys 175, Lys 238), with the 6-0 sulfate of the 
glucosarcune (e.g., Lys 107 and possibly Thr 104) and with the 2-0 sulfate itself (Lys 134, 
Lys 308) are likely to serve in this capacity (Table 4). At the same time, intemction of the 2- 
O sulfate group with charged amino acids would also enhance any electron density 

20 withdrawal from the oxygen atoms, thereby increasing the electrophiUcity of the sulftu: 

center. It has also been suggested that the nucleophiUcity of the Oyl atom is enhanced by a 
possible proton donation to a neighboring aspartic acid residue. In our structtral model of the 
2-0 sulfatase, this residue would correspond to Asp 295. 

An Sn2 mechanism may follow the above steps and eventually lead to the cleavage of 

25 the sulfate ester bond. In this mechanism, the exocyclic oxygen atom on the leaving substrate 
may be protonated by water or potentially by neighboring amino acids. In the 2-0 sulfatase 
active site model, Lys 308 is juxtaposed to protonate the leaving group (Fig. 11). The 
resulting sulfate group on the geminal diol is subsequently eUminated by abstraction of a 
proton from 0^2 regenerating the formylglycme. His 136 is positioned to abstract this 

30 proton. 

As we have aheady pointed out, our homology-based model of the 2-0 sulfatase has 
several structure-function imphcations relating to substrate specificity. Many of these points 
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are summarized in Table 4. When exanained from the perspective of oUgosaccharide 
structure, our model addresses the issue of substrate specificity principally as it relates to the 
following parameters: 1) the exolytic action of the enzyme; 2) the influence of 
oligosaccharide chain length; 3) the presumed requirement for an unsaturated double bond at 
5 the non-reducing end; 4) the number and position of additional sulfates present within the 
glucosamine adjoining the 2-0 sulfated uronic acid and; 5) the nature of the glycosidic 
hnkage position between these two monosaccharides, hi the example which follows, each of 
these predictions is empirically examined through biochemical and kinetic studies defining 
substrate preference. 

10 

Exolytic Action of the 2-0 sulfatase — ^We addressed this important question using as a 
substrate the purified heparin-derived AT- 10 decasaccharide 

AU2sHns,6sI2sHms,6sI2sHns,6sIHnac,6sGHns,3S,6S. This oUgosaccharide possesses a A 4,5. 
unsaturated uronic acid at the non-reducing end and both externally and internally positioned 
15 2-0 sulfates. The substrate was first exhaustively treated with the 2-0 sulfatase. The 2-0 
desulfated decasaccharide was then subjected to an exhaustive heparinase treatment. CE- 
based compositional analyses indicated the disappearance of the disaccharide AU2sHns.6S by 
only one-third; two-thirds of this trisulfated disaccharide remained after sequential treatment 
with the 2-0 sulfatase and heparin lyases (Fig. 12). Loss of a single sulfate was 
20 independently detennined by mass spectrometry. The loss of the single sulfate to the 

terminal 2-OH position is suggested given the fact that the internally positioned iduronic acid 
2-0 sulfates are structurally identical and should therefore possess the same potential for 
desulfation. Based on this assumption, the 2-0 sulfatase would appear to act in an exolytic 
fashion. Our model clearly predicts a strong preference for sulfates positioned at the non- 
25 reducing end where these sulfates would not be constrained by the narrow topology of the 
enzyme active site. 

Tlie requirement for an unsaturated A 4,5 non-reducing terminus — ^In a related experiment, 
we assessed the abihty of the 2-0 sulfatase to hydrolyze size-fractionated hexasaccharides 
30 derived from the nitrous acid treatment of hepariiL Unlike enzymatic cleavage, these 
chemically-derived heparin saccharides do not possess a A 4,5 unsaturated bond at their 
respective non-reducing ends. A majority of the resultant tetrasaccharides, however, do 
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contaiB an ts at this end Using MALDI-MS, we were unable to detect any enzyme- 
dependent desulfation of treated hexasaccharides. This result strongly suggests a structural 
requirement for the A 4,5 bond. The rationale for this is described above in relation to our 
molecular modeling. In particular, the physical connection between this bond and the planar 
5 C5 carboxylate of the uronic acid carboxylate and how such a constraint permits critical 
enzyme-substrate interactions for the proper orientation of the 2-0 sulfate within the enzyme 
active site was described. 

Determination of disaccharide substrate kinetics and specificity— We were interested in 

10 ascertaining any kinetic discrimination the enzyme may possess for its disaccharide 

substrates based on the following stmctural considerations. 1) the number and position of 
' sulfates on the adjoining hexosamine; 2) the glycosidic linkage position (i.e., jSl-^ 4 versus 
al-^ 3); and 3) glucosamine vs. galactosamine as the adjoining hexosamine. We examined 
substrate saturation kinetics measured under Michaelis-Menten conditions. For these 

1 5 experiments, several heparin disaccharide substrates were used, each with a uronic acid 

possessing a 2-0 sulfate and a A 4,5 unsaturated bond at the non-reducing end, but differing 
in the degree of sulfation within the glucosamine. In addition the two unsaturated 
chondroitin disaccharides AU2sGalNAc,4s and AU2sGalKAc,6s were also examined as possible 
substrates. These latter two disaccharides differ from those derived from heparin/heparan 

20 sulfate in possessing a j81-> 3 glycosidic hnkage and a galactosamine in place of a 
glucosamine. The results are sunmiarized in Fig, 13 and Table 5. All of the heparin 
disaccharides examined were hydrolyzed at substantial rates that included kcat values which 
varied from approximately 600 to 1700 sec'^ At the same time, the 2-0 sulfatase did exhibit 
a substrate discrimination apparently based on the extent of sulfation and largely manifested 

25 as a Km effect. In particular, the presence of a 6-0 sulfate on the adjoining glucosamine 

conferred a significantly lower relative to its counterpart lacking such a sulfate ester. In 
tenns of catalytic efficiency, the trisulfated disaccharide (AU2sHns,6s) was clearly the 
preferred substrate whereas the mono-sulfated disaccharide (AU2sHnac) was least prefenred. 
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Table 5. 2-O-sulfatase disaccharide substrate specificity 



Disacctaiulde 


k«. fsec"^) 


KbCioM) 






1672 


0.515 


3247 


AU2sHNS.fiS 


814 


0.C87 


9356 


AiUjsHns 


911 


1,06 


859 


AlfesHltBe 


673 


4.66 


144 


AU2sGaW6S* 


<I00 


>I0. 





Kinetic parameters were derived from a non-linear regressional analyses of substrate saturation data depicted in 
Fig- 13. *Kinetic values for the unsaturated chondroitin disaccharide were approximated from double reciprocal 
5 plots. N.D. not determined. 

The 2-0 siilfated chondroitiii disaccharide AU2sGal>fAc,6S was only neglibly 
hydrolyzed under the same kinetic conditions. The enzyme did desnlfate this disaccharide to 
an appreciable extent, however, under reaction conditions involving a 4X higher enzyme 

10 concentration and a longer incubation time. Under these conditions, approximately 40%, of 
the substrate was desulfated over a 20 minute period. Li contrast, less than 10% of 
chondroitin disaccharide AU2sGalNAc,4S was hydrolyzed diuing the same time period. To 
determine whether either or both of these 2-0 sulfated chondroitin disaccharides could be 
quantitatively desulfated under exhaustive conditions, we carried out an 18 hour incubation at 

15 30°C that included 5 mM of substrate and 5 jxM enzyme. Under these conditions, both 

chondroitin disaccharides were greater than 95% desulfated at the 2-0 position. This result 
indicates that while hnkage position and/or hexosamine isomerization are discriminating 
kinetic factors, these physical parameters are not absolute determinants for 2-0 sulfatase 
substrate recognition. It is interesting to consider this latter observation in the context of the 

20 lysosomal pathway for glycosaminoglycan degradation hi mammals where one enzyme 
desulfates both chondroitin and HS ohgosaccharides at this position. 

The ^parent kinetic discrimination described above points to an underlying structural 
determinant, namely a preference for glucosamine sulfated at the 6-OH and 2N positions. 
Our model does predict a favorable interaction with the 6-0 sulfate in correct optimal 

25 orientation. At the same time, we would predict a bias in favor of acetylation of the N- 
position rather than sulfation due to potential hydrophobic interactions. 

2-0 sulfatase peptide mapping and chemical modification of active site formylglycine — 
Finally, in describing the structure-function relationship of the 2-0 sulfatase active site, we 
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come to the central catalytic player itself— the fonnylglycine at position 82. The 
Tecombinant expression of catalytically active 2-0 sulfatase in E, coli functionally argues for 
this covalent modification of the active site in vivo. ,We established the catalytic function of 
Cys 82 by site-directed mutagenesis. The mutant (C82A) was recombinantly expressed and 
5 purified as a histidine-tagged protein in the same manner employed for the wild-type enzyme. 
Comparable expression levels of soluble protein were achieved. The C82A mutant, however, 
was completely inactive. Both the wild-type and mutant possessed the same secondary 
stnicture as exhibited by their virtually superimposible CD spectra (Fig. 14), argumg against 
any adverse global confoimational changes induced by the molecular replacement of the 

1 0 cysteine by alanine. 

We also set out to demonstrate the physical presence of the FGly at position 82 by the 
tandem use of protein chemistry and mass spectrometry. 10 nanomoles of wild-type sulfatase 
(2-0 AM^'^"^) and the C82A mutant were reacted with Texas red hydrazide (620.74 Da) as 
described in Materials and Methods, The two sulfatase firactions were subsequently 

15 trypsinized under mildly denaturing conditions followed by reductive methylation of the 
unmodified cysteines. The molecular masses of the resultant peptides were determined by 
MALDI-MS (Fig. 15). In this experiment, we identified a single ionized species uniquely 
present in the labeled sulfatase experiment (Fig. 15, Panel (B)), but absent in the active site 
mutant (Fig. 15, Panel (C)) or in the unlabeled control (Fig. 15, Panel (A)). The empirical 

20 mass of this species conresponded most closely to the peptide sequence 

FTRAYCAQPLCTPSR (SEQ ID NO: 37) resultant firom a partial trypsin cleavage. This 
peptide contains the sulfatase consensus sequence CXPXR which includes the critical active 
site cysteine (denoted in bold) at position 82. The mass of this peptide is consistent with first 
the conversion of this cysteine to a formylglycme (FGly 82) followed by the covalent 

25 hydrazone hnkage of the aldehyde-reactive fluorophore at this position. It also takes into 
account the carbamidomethylation of the second (unmodified) cysteine present in this 
peptide. These data, taken together with the loss of function observed for the C82A mutant, 
establish the important stmcture-function relatiotiship for this active site modification. 

Each of the foregoing patents, patent applications and references that are recited in 

30 this application are herein incorporated in flieir entirety by reference. Having described the 
presently prefenred embodiments, and in accordance with the present invention, it is beheved 
that other modifications, variations and changes will be suggested to those skilled in the art in 
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view of the teachings set forth herein. It is, therefore, to be understood that all such 
variations, modifications, and changes are believed to fall within the scope of the present 
invention as defined by the appended claims. 
We claim: 



>Xt ' 'i^^^Ql _WO 2004/062592. ' .x. \ .«MWm^A>^ PCTAJS20tf4/000332 



-82- 

CLAIMS 

L A composition comprising 2-0 sulfatase and a phaimaceutically acceptable carrier. 

5 2. The composition of claim 1 , wherein the 2-0 sulfatase is produced by expressing 

an isolated nucleic acid molecule selected from the group consisting of: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic 
acid molecule having a nucleotide sequence selected from the group consisting of nucleotide 
sequences set forth as SEQ ID NOs: 1 and 3, and which code for a 2-0 sulfatase, 
10 (b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 

sequence due to degeneracy of the genetic code, and 
(c) complements of (a) or (b). 

3. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid 
15 molecule comprises a nucleic acid sequence as set forth as SEQ ID NO: 3. 

4. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid 
molecule codes for SEQ ID NO: 4. 

20 5. The composition of claim 1 , whereiu the 2-0 sulfatase is produced by expressing 

an isolated nucleic acid molecule comprising a nucleotide sequence that is at least about 90% 
identical to a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1 and 
3. 

25 6. The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 

comprises a nucleotide sequence that is at least about 95% identical. 

7. The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence that is at least about 97% identical. 

30 

8. The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence that is at least about 98% identical. 
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9. The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence that is at least about 99% identical. 

10. The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence that is at least about 99.5% identical. 

1 1 . The isolated nucleic acid molecule of claim 5, wherein the nucleic acid molecule 
comprises a nucleotide sequence that is at least about 99.9% identic 

12. The composition of claim 1, wherein the 2-0 sulfatase is produced 
recombraantly. 

13. The composition of claim 12, wherein the 2-0 sulfatase is recombinantly 
expressed in £^ coli. 

14. A composition comprisiag an expression vector comprising 

the isolated nucleic acid molecule of any one of claims 2-1 1 operably linked to a 
promoter, and 

a phaimaceutically acceptable carrier. 

15. A composition comprisiag a host cell comprising the expression vector of claim 
14 and a pharmaceutically acceptable carrier. 

16. The composition of claim 1, wherein the 2-0 sulfatase has an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 2 and 4 or a functional variant 
thereof. 

17. The composition of claim 16, wherein the 2-0 sulfatase has the amino acid 
sequence as set forth in SEQ ID NO: 4. 

18. The composition of of claim 16 or 17, wherein the amino acid sequence contains: 
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a) a residue selected from the group consisting of Arg 86, Asp 42, Asp 159, Asp 295, 
Cys 82, FGly 82, Gin 43, Gin 237, Glu 106, Gin 309, ffis 136, ffis 296, Leu 390, Leu 391, 
Leu 392, Lys 107, Lys 134, Lys 175, Lys 238, Lys 308 and Thi 104 and 

b) at least one amino acid substitution. 

19. The composition of claim 18, wherein the amino acid sequence of the 2-0 
sulfatase contains a Cys 82 residue. 

20. The composition of claim 19, wherein the Cys 82 residue is modified to formyl 
glycine. 

21. The composition of claim 18, wherein the amino acid sequence of the 2-0 
sulfatase contains a FGly 82 residue. 

22. The The composition of claim 16, wherein the 2-0 sulfatase is synthetic. 

23. A pharmaceutical preparation, comprising: 

a degraded glycosaminoglycan produced by contacting a glycosaminoglycan with a 2- 
O sulfatase and a pharmaceutically acceptable carrier. 

24. The method of claim 23, wherein the degraded glycosaminoglycan was produced 
by also contacting the glycosanunoglycan with at least one other glycosaminoglycan 
degrading enzyme. 

25. The method of claim 24, wherein the glycosaminoglycan is contacted with the at 
least one other glycosaminoglycan degrading enzyme concomitantly with the 2-0 sulfatase. 

26. The method of claim 24, wherein the at least one other glycosaminoglycan 
degrading enzyme is heparinase orglycuronidase. 

27. A method of inhibiting angiogenesis, comprising: 
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adinimstering to a subject in need thereof an ejBfective amount of a composition or 
pharmaceutical preparation selected from the group consisting of a) a composition of any one 
of claims 1, 16 or 1 8 and b) the pharmaceutical preparation of claim 23 for inhibiting 
angiogenesis. 

28. A method of treating cancer, comprising: 

administering to a subject in need thereof an effective amount of a composition or 
phamiaceutical preparation selected from the group consisting of a) a composition of any one 
of claims 1, 16 or 18 and b) the pharmaceutical preparation of claim 23 for treating cancer. 

29. A method of inhibiting cellular proliferation, comprising: 
administering to a subject in need thereof an effective amount of a composition or 

pharmaceutical preparation selected from the group consisting of a) a composition of any one 
of claims 1, 16 or 18 and b) the pharmaceutical preparation of claim 23 for inhibiting cellular 
proliferation. 

30. A method of treating neurodegenerative disease, comprising: 
administering to a subject in need thereof an effective amount of a composition. or 

pharmaceutical preparation selected from the group consisting of a) a composition of any one 
20 of claims 1 , 1 6 or 1 8 and b) the pharmaceutical preparation of claim 23 for treating 
neurodegenerative disease. 

31. The method of claim 30, wherein the neurodegenerative disease is Alzheimer's 
disease. 

25 

32. A method of treating atherosclerosis, comprising: 

administering to a subject in need thereof an effective amount of a composition or 
pharmaceutical preparation selected from the group consisting of a) a composition of any one 
of claims 1, 16 or 18 and b) the phamaaceutical prq)aration of claim 23 for treating 
30 atherosclerosis. 

33. A method of treating or preventing microbial infection, comprising: 
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admimstering to a subj ect in need thereof an effective amount of a composition or 
pharmaceutical preparation selected from the group consisting of a) a composition of any one 
of claims 1, 16 or 18 and b) the pharmaceutical preparation of claim 23 for treating or 
preventing microbial infection. 

34. A composition comprising a 2-0 sulfatase and a pharmaceutically acceptable 
carrier, wherein the 2-0 sulfatase contains at least one amino acid residue that has been 
substituted with a different amino acid than in native 2-0 sulfatase and wherein the residue 
that has been substituted is selected from the group consisting of Arg 86, Asp 42, Asp 159, 
Asp 295, Gin 43, GM 237, Glu 106, Gin 309, ffis 136, ffis 296, Leu 390, Leu 391, Leu 392, 
Lys 107, Lys 134, Lys 175, Lys 238, Lys 308 and Thr 104. 

35. The composition of claim 34, wherein the 2-0 sulfatase has a higher specific 
activity than native 2-0 sulfatase. 
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<211> 1395 
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<400> 1 

atgaagatgt acaaatcgaa aggctggttg atagccatgc ttatacttgc aggttttgga 60 

gatgcagggg cgcaaacctc aaaagtagca gcttccaggc ctaacatcat tatcatcatg 120 

acagatcagc aaacagctga tgccatgagc aatgctggta ataaggacct gcatacacct 180 

gcaatggatg ttttggctgc aaacggtacc cgttttacac gtgcctattg tgcccagccg 240 

ctctgtacac cttcacgctc cgcgatattt agcggaaaaa tgccacatga aaccggcttt 300. 

acggggaata caccggaaaa ggacggacag tggcccgatt ctgtgctgat gatgggcaaa 360- 

atatttaagg caggaggcta taaaaccggc tacgtcggaa aatggcacct gcctgttcct 420 

gttactaaag tagcacaaca tggatttgag actattgaga atacaggtat gggcgattat 480 

accgatgcag ttaccccatc gcaatgcgcc aacttcaata aaaagaataa agacaaccca 540 

tttttactgg tagcatcctt tttgaaccca cacgatattt gtgaatgggc aaggggtgat 600 

aatttgaaaa tggatgttct ggatgcagcg ccggatacag cattttgtcc gaaattacct 660 

gccaactggc caattccggc ttttgagcct gccattgtaa gggaacagca aaaggtgaac 720 

ccgcgtactt atccttcggt aggctggaac gaaagccagt ggcgcaaata ccgctgggcc 780 

tataaccgcc tggtagagaa ggtagacaat tatatggcca tggtattggg ttcgttaaaa 840 

aaatatggta tagaagacaa taccatcatc atctttacca gcgatcatgg tgatggttat 900 

gcggcacatg agtggaacca gaagcagatt ttgtatgagg aggctgccag gatacctttt 960 

atcatctcga agatcggaca atggaaagcc agaaccgatg atcagctggt ttgcaatggc 1020 

atcgatatta tccccaccat atgtggcttt gccggaattg ctaaacctgt tggtttaaaa 1080 
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ggcctggatt taagtaaacg tattgccaac ccttcggtta aactacggga tactttagtg 1140 

atagaaaccg attttgctga taacgaactg ttgctgggta ttaagggcag ggcagtgatt 1200 

accaaagatt ttaaatacat tgtttatgac aagggggaga tccgggaaca attgtttgac 1260 

ctggaaaaag acgcaggaga aatggataac ctggctgtta aacccgccta taaaaagaaa 1320 

ttgaatgaaa tgcgcgctta cctgaaacta tggtgtaaac agcaccagga ttcgttttat 1380 

gcattaaaaa aataa 13 95 
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Thr 


He 


Glu 


Asn 


Thr Gly 
155 


Met 


Gly Asp Tyr 
160 


Thr 


Asp Ala 


Val 


Thr 
165 


Pro 


Ser 


Gin Cys 


Ala 
170 


Asn Phe 


Asn 


Lys Lys Asn 
175 


Lys 


Asp Asn 


Pro 
180 


Phe 


Leu 


Leu 


Val 


Ala 
185 


Ser 


Phe Leu 


Asn 


Pro His Asp 
190 


He 


Cys Glu 
195 


Trp 


Ala 


Arg 


Gly 


Asp Asn 
200 


Leu 


Lys Met 


Asp 
205 


Val Leu Asp 


Ala 


Ala Pro 
210 


Asp 


Thr 


Ala 


Phe 
215 


Cys 


Pro 


Lys 


Leu Pro 
220 


Ala 


Asn Trp Pro 
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lie Pro Ala Phe Glu Pro Ala He Val Arg Glu Gin Gin Lys Val Asn 
225 230 235 240 

Pro Arg Thr Tyr Pro Ser Val Gly Trp Asn Glu Ser Gin Trp Arg Lys 
245 250 255 

Tyr Arg Trp Ala Tyr Asn Arg Leu Val Glu Lys Val Asp Asn Tyr Met 
260 265 270 

Ala Met Val Leu Gly Ser Leu Lys Lys Tyr Gly He Glu Asp Asn Thr 
275 , 280 285 

He He He Phe Thr Ser Asp His Gly Asp Gly Tyr Ala Ala His Glu 
290 295 300 

Trp Asn Gin Lys Gin He Leu Tyr Glu Glu Ala Ala Arg He Pro Phe 
305 310 315 320 

He He Ser Lys He Gly Gin Trp Lys Ala Arg Thr Asp Asp Gin Leu 
325 330 335 

Val Cys Asn Gly He Asp He He Pro Thr He Cys Gly Phe Ala Gly 
340 345 350 

He Ala Lys Pro Val Gly Leu Lys Gly Leu Asp Leu Ser Lys Arg He 
355 360 365 

Ala Asn Pro Ser Val Lys Leu Arg Asp Thr Leu Val He Glu Thr Asp 
f 370 375 380 

Phe Ala Asp Asn Glu Leu Leu Leu Gly He Lys Gly Arg Ala Val He 
385 390 395 400 

Thr Lys Asp Phe Lys Tyr He Val Tyr Asp Lys Gly Glu He Arg Glu 
405 410 415 

Gin Leu Phe Asp Leu Glu Lys Asp Ala Gly Glu Met Asp Asn Leu Ala 
420 425 430 

Val Lys Pro Ala Tyr Lys Lys Lys Leu Asn Glu Met Arg Ala Tyr Leu 
435 440 445 

Lys Leu Trp Cys Lys Gin His Gin Asp Ser Phe Tyr Ala Leu Lys Lys 
450 455 460 



<210> 3 
<211> 1323 
<212> DNA 

<213> Flavobacterium heparinum 
<400> 3 

caaacctcaa aagtagcagc ttccaggcct aacatcatta tcatcatgac agatcagcaa 60 
acagctgatg ccatgagcaa tgctggtaat aaggacctgc atacacctgc aatggatgtt 120 
ttggctgcaa acggtacccg ttttacacgt gcctattgtg cccagccgct ctgtacacct 180 
tcacgctccg cgatatttag cggaaaaatg ccacatgaaa ccggctttac ggggaataca 240* 
ccggaaaagg acggacagtg gcccgattct gtgctgatga tgggcaaaat atttaaggca 300 
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ggaggctata aaaccggcta cgtcggaaaa tggcacctgc ctgttcctgt tactaaagta 



3 60 



gcacaacatg gatttgagac tattgagaat acaggtatgg gcgattatac cgatgcagtt 
accccatcgc aatgcgccaa cttcaataaa aagaataaag acaacccatt tttactggta 



420 
4 80 



gcatcctttt tgaacccaca cgatatttgt gaatgggcaa ggggtgataa tttgaaaatg 540 

gatgttctgg atgcagcgcc ggatacagca ttttgtccga aattacctgc caactggcca 600 

attccggctt ttgagcctgc cattgtaagg gaacagcaaa aggtgaaccc gcgtacttat 660 

ccttcggtag gctggaacga aagccagtgg cgcaaatacc gctgggccta taaccgcctg 72 0 

gtagagaagg tagacaatta tatggccatg gtattgggtt cgttaaaaaa atatggtata 780 

gaagacaata ccatcatcat ctttaccagc gatcatggtg atggttatgc ggcacatgag B40 

tggaaccaga agcagatttt gtatgaggag gctgccagga taccttttat catctcgaag 900 

atcggacaat ggaaagccag aaccgatgat cagctggttt gcaatggcat cgatattatc 960 

cccaccatat gtggctttgc cggaattgct aaacctgttg gtttaaaagg cctggattta 102 0 

agtaaacgta ttgccaaccc ttcggttaaa ctacgggata ctttagtgat agaaaccgafc 108 0 

tttgctgata acgaactgtt gctgggtatt aagggcaggg cagtgattac caaagatttfc 114 0 

aaatacattg tttatgacaa gggggagatc cgggaacaat tgtttgacct ggaaaaagac 1200 

gcaggagaaa tggataacct ggctgttaaa cccgcctata aaaagaaatt: gaatgaaatg 1260 

cgcgcttacc tgaaactatg gtgtaaacag caccaggatt cgttttatgc attaaaaaaa 132 0 



<210> 4 
<211> 440 
<212> PRT 

<213> Flavobacterium heparinum 
<400> 4 

Gin Thr Ser Lys Val Ala Ala Ser Arg Pro Asn lie lie lie lie Met 
15 10 15 

Thr Asp Gin Gin Thr Ala Asp Ala Met Ser Asn Ala Gly Asn Lys Asp 
20 25 30 

Leu His Thr Pro Ala Met Asp Val Leu Ala Ala Asn Gly Thr Arg Phe 
35 40 45 

Thr Arg Ala Tyr Cys Ala Gin Pro Leu Cys Thr Pro Ser Arg Ser Ala 
50 55 60 

rie Phe Ser Gly Lys Met Pro His Glu Thr Gly Phe Thr Gly Asn Thr 
65 70 75 80 

Pro Glu Lys Asp Gly Gin Trp Pro Asp Ser Val Leu Met Met Gly Lys 
85 90 95 

lie Phe Lys Ala Gly Gly Tyr Lys Thr Gly Tyr Val Gly Lys Trp His 



taa 



1323 
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Leu Pro Val Pro Val Thr Lys Val Ala Gin His Gly Phe Glu Thr He 
115 120 125 

Glu Asn Thr Gly Met Gly Asp Tyr Thr Asp Ala Val Thr Pro Ser Gin 
130 135 140 

Cys Ala Asn Phe Asn Lys Lys Asn Lys Asp Asn Pro Phe Leu Leu Val 

150 155 

Ala Ser Phe Leu Asn Pro His Asp He Cys Glu Trp Ala Arg Gly Asp 
155 170 175 

Asn Leu hys} Met Asp Val Leu Asp Ala Ala Pro Asp Thr Ala Phe Cys 
180 185 190 

Pro Lys Leu Pro Ala Asn Trp Pro He Pro Ala Phe Glu Pro Ala He 
195 200 205 

Val Arg Glu Gin Gin Lys Val Asn Pro Arg Thr Tyr Pro Ser Val Gly 
210 215 220 

Trp Asn Glu Ser Gin Trp Arg Lys Tyr Arg Trp Ala Tyr Asn Arg Leu 
225 230 235 240 

Val Glu Lys Val Asp Asn Tyr Met Ala Met Val Leu Gly Ser Leu Lys 
245 250 255 

Lys Tyr Gly He Glu Asp Asn Thr He He He Phe Thr Ser Asp His 
260 265 270 

Gly Asp Gly Tyr Ala Ala His Glu Trp Asn Gin Lys Gin He Leu Tyr 
275 280 285 

Glu Glu Ala Ala Arg He Pro Phe He He Ser Lys He Gly Gin Trp 
290 295 300 

Lys Ala Arg Thr Asp Asp Gin Leu Val Cys Asn Gly He Asp He He 
305 310 315 320 

Pro Thr He Cys Gly Phe Ala Gly He Ala Lys Pro Val Gly Leu Lys 
325 330 335 

Gly Leu Asp Leu Ser Lys Arg He Ala Asn Pro Ser Val Lys Leu Arg 
340 345 350 

Asp Thr Leu Val He Glu Thr Asp Phe Ala Asp Asn Glu Leu Leu Leu 
355 360 365 

Gly He Lys Gly Arg Ala Val lie Thr Lys Asp Phe Lys Tyr He Val 
370 375 380 

Tyr Asp Lys Gly Glu He Arg Glu Gin Leu Phe Asp Leu Glu Lys Asp 
385 390 395 4OO 

Ala Gly Glu Met Asp Asn Leu Ala Val Lys Pro Ala Tyr Lys Lys Lys 
405 410 415 

Leu Asn Glu Met Arg Ala Tyr Leu Lys Leu Trp Cys Lys Gin His Gin 
420 425 430 
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Asp Ser Phe Tyr Ala Leu Lys Lys 
435 440 



<210> 5 

<211> 11 

<212> PRT 

<213> Artificial Seqpience 
<220> 

<223> Sulfatase Consensus Sequence 
<220> 

<221> MISC_FEATURE 

<222> (2).. (2) 

<223> Xaa=any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (4)., (4) 

<223> Xaa=any amino acid 

<220> 

< 2 2 1 > MI SC^FEATURE 

<222> (6).. (9) 

<223> Xaa=any amino acid 

<220> 

<221> MISC_FSATURE 

<222> (10).. (10) 

<223> Xaa=serine or threonine 

<400> 5 

Cys Xaa Pro Xaa Arg Xaa Xaa Xaa Xaa Xaa Gly 
15 10 



<210> 6 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Sulfatase Consensus Sequence 
<220> 

<221> MISC_FEATURE 

<222> (1) . - (1) 

<223> Xaa=cysteine or serine 

<220> 

<221> MIS COFEATURE 

<222> (2) . - (2) 

<223> Xaa=any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (4) . . (4) 

<223> Xaa=any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (6) . . (9) 

<223> Xaa=any amino acid 

<220> 

<221> MISC_FEATaRE 

<222> (10) . . (10) 

<223> Xaa=serine or threonine 

<400> 6 

Xaa Xaa Pro Xaa Arg Xaa Xaa Xaa Xaa Xaa Gly 
15 10 



<210> 7 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Putative Consensus Sequence 
<220> 

<221> MISC_FEATURE 

<222> (5) . . (5) 

<223> Xaa=any hydrophobic amino acid 

<400> 7 

Gly Lys Trp His Xaa 
1 5 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc_feature 
(15).. (15) 
n is a, c, g. 



tni s c_f ea t \ir e 
(18) . . (18) 
n is a, c, g, 



or t 



or t 



<400> 8 

athgayatha thccnacnat h 



<210> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<220> 

<221> misc_f eature 

<222> (4).. (4) 

<223.> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (13).. (13) 

<223> n is a, c, g, or t 

<400> 9 

datngtytca . ttnccrtgyt g 



lie 



21 



<210> 10 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 10 

catacacgta tgggcgatta t 



21 



<210> 11 

<211> 20 

<212> DRA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 11 

gatgtgggga tgatgtcgat 



20 



<210> 12 

<211> 35 

<212> DRA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 12 

tgttctagac atatgaagat gtacaaatcg aaagg 



35 



<210> 13 
<211> 41 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<4:00> 13 

gtctcgagga tccttatttt tttaatgcat aaaacgaatc c 41 

<210> 14 

<211> 34 

<212> DNA 

<213> Artificial Sequence 

<220> , 



<223> Synthetic Oligonucleotide 
<400> 14 

gatattatcc ccaccatctg tggctttgcc ggaa 

<210> 15 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 15 

ttccggcaaa gccacagatg gtggggataa tatc 



34 



34 



<210> 16 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 16 

tctagacata tgcaaacctc aaaagtagca get 33 



<210> 17 
<211> 12 
<212> PRT 

<213> Flavobacterium heparinura 
<400> 17 

Met Gin Thr Ser Lys Val Ala Ala Ser Arg Pro Asn 
15 10 



<2X0> 18 
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<21l> 33 'i'r: 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 18 

tctagacata tgcaaacctc aaaagtagca get 33 



<210> 19 

<211> 41 

<212> DlsTA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 19 

gtctcgagga tccttatttt tttaatgcat aaaacgaatc c 41 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 20 

ccagccgctc gctacacctt cacg 



<210> 21 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 21 

cgtgaaggtg tagcgagcgg ctgg 24 



<210> 22 

<211> 10 

<212> PRT 

<213> Flavobacterium heparinum 

<400> 22 

Tyr lie Val Tyr Asp Lys Gly Glu lie Arg 
1 5 10 



<210> 23 
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<211> 13 
<212> PRT 

<213> Flavobacterium heparinum 



<400> 23 

Thr Tyr Pro Ser Val Gly Trp Asn Glu Ser Gin Trp Arg 
1 5 10 



<210> 24 
<211> 28 
<212> PRT 

<213> Flavobacterium heparinum 
<400> 24 

Lys Met Pro His Glu Thr Gly Phe Thr Gly Asn Thr Pro Glu Lys Asp 
15 10 15 

Gly Gin Trp Pro Asp Ser Val Leu Met Met Gly Lys 
20 25 



<210> 25 

<211> 31 

<212> PRT 

<213> Flavobacterium heparinum 

<400> 25 

Val Ala Gin His Gly Phe Glu Thr 
1 5 

Tyr Thr Asp Ala Val Thr Pro Ser 
20 



He Glu Asn Thr Gly Met Gly Asp 
10 15 

Gin Cys Ala Asn Phe Asn Lys 
25 30 



<210> 26 

<211> 24 

<212> PRT 

<213> Flavobacterium heparinum 

<400> 26 

Thr Asp Asp Gin Leu Val Cys Asn Gly He Asp He He Pro Thr He 
1 5 10 15 

Cys Gly Phe Ala Gly He Ala Lys 
20 



<210> 


27 


<211> 


20 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Oligonucleotide 


<220> 
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<221> mis e_f e at ur e 

<222> (9).. (9) 

<223> n is a, c, g, or t 



<400> 27 

tayathgtnt aygayaargg 



<210> 28 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mis Cofeature 
(1)..(1) 
n is a, c, g. 



Tnisc_f eatiire 
(13) ..(13) 
n is a, c, g. 



or t 



or t 



<400> 28 

nccyttrtcr tanacdatrt a 



<210> 29 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<220> 

<221> misc_feature 

<222> (9) . . (9) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (18).. (18) 

<223> n is a, c, g, or t 

<400> 29 

carcayggnt tygaracnat 



30 
21 
DNA 

Artificial Sequence 
<220> 



<210> 
<211> 
<212> 
<213> 
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<223> Synthetic Oligonucleotide 
<220> 

<221> misc_feature 

<222> (4).. (4) 

<223> n is a, C; g, or t 

<220> 

<221> misc_f eature 

<222> (13) . . (13) 

<223> n is a, c, g, or t 

<400> 30 

datngtytca ttnccrtgyt g 21 



<210> 31 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<220> 

< 2 2 1 > mi B c_f ea t ure 

<222> (9) . . (9) 

<223> n is a, c, g, or t 

<400> 31 

tayathgtnt aygayaargg 



<210> 32 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(1) (1) 
n is a> c, g, 

mis cofeature 
(10) .. (10) 
n is a, c, g, 



or t 



or t 



<400> 32 

nccyttrtan acdatrta 



<210> 33 

<211> 20 

<312> DNA 

<213> Artificial 



Sequence 
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<:220> 

<223> Synthetic Oligonucleotide 
<220> 

<221> tnisc_feature 

<222> (I5> . . (15) 

<223> n is a, c, or t 

<220> 

<221> niisc_f eature 

<222> (18) . . (18) 

<223> n is a, c, g, or t 

<400> 33 

athgayatha thccnacnat 



20 



<210> 
<211> 
<212> 



34 
21 

DNTA 



<213> Artificial Sequence 



<220> 



<223> Synthetic Oligonucleotide 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mis c_f eature 
(4) .. (4) 

n is a, c, g, or t 



niisc_feature 
(7) (7) 

n is a, c, g, or t 



<400> 34 

datngtnggd atdatrtcda t 



21 



<210> 
<211> 
<212> 
<213> 

<220> 



35 
23 
DNA 

Artificial Sequence 



<223> Synthetic Oligonucleotide 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



rnisc_feature 
(12) . . (12) 
nis a, c, g, ort 



mis cofeature 
(15) . . (15) 
n is a, c, g, or t 



<400> 35 
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<210> 


36 


<211> 


23 


<212> 


DNA 


<213> 


T^tI" 1 f^H pt a 1 flpcnieTice 

JnJL, -L. ^ ^ ^ JL JL >J ^ \A, v.r ,lX\tr\.f 


<220> 




<223> 


Synthetic Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(9) (9) 


<223> 


n is a, c, g, or t 


<220> 




<221> 


mi s c_f e a tur e 


<222> 


(12) . . (12) 


<223> 


n is a, c, g, or t 



<400> 36 

aarcadatng tnggdatdat rtc ' 23 



<210> 37 
<211> 15 
<212> PRT 

<213> Flavobacterium heparinum 
<400> 37 

Phe Thr Arg Ala Tyr Cys Ala Gin Pro Leu Cys Thr Pro Ser Arg 
15 10 15 



<210> 38 
<211> 1407 
<212> DNA 

<213> Flavobacterium heparinum 
<400> 38 

agtaaacata acatgaagat gtacaaatcg aaaggctggt tgatagccat gcttatactt 
gcaggttttg gagatgcagg ggcgcaaacc tcaaaagtag cagcttccag gcctaacatc 
attatcatca tgacagatca gcaaacagct gatgccatga gcaatgctgg taataaggac 
ctgcatacac ctgcaatgga tgttttggct gcaaacggta cccgttttac acgtgcctat 
fcgtgcccagc cgctctgtac accttcacgc tccgcgatat ttagcggaaa aatgccacat 
gaaaccggct ttacggggaa tacaccggaa aaggacggac agtggcccga ttctgtgctg 
atgatgggca aaatatttaa ggcaggaggc tataaaaccg gctacgtcgg aaaatggcac 
ctgcctgttc ctgttactaa agtagcacaa catggatttg agactattga gaatacaggt 
atgggcgatt ataccgatgc agttacccca tcgcaatgcg ccaacttcaa taaaaagaat 
aaagacaacc catttttact ggtagcatcc tttttgaacc cacacgatat ttgtgaatgg 
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180 
240 
300 
360 
420 
480 
540 
600 
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gcaaggggtg- 


ataatttgaa 


aatggatgtt 


ctggatgcag 


cgccggatac 


agcattttgt 


660 


ccgaaattac 


ctgccaactg 


gccaattccg 


gcttfctgagc 


ctgccattgt 


^agggaacag 


720 


caaaaggtga 


acccgcgtac 


ttatccttcg 


gtaggctgga 


acgaaagcca 


gtggcgcaaa 


780 


taccgctggg 


cctataaccg 


cctggtagag 


aaggtagaca 


attatatggc 


catggtattg 


840 


ggttcgttaa 


aaaaatatgg 


tatagaagac 


aataccatca 


tcatctttac 


cagcgatcat 


900 


ggtgatggtt 


atgcggcaca 


tgagtggaac 


cagaagcaga 


ttttgtatga 


ggaggctgcc 


960 


aggatacctt 


ttatcatctc 


gaagatcgga 


caatggaaag 


ccagaaccga 


tgatcagctg 


1020 


gtttgcaatg 


gcatcgatat 


tatccccacc 


atatgtggct 


ttgccggaat 


tgctaaacct 


1080 


gttggtttaa 


aaggcctgga 


tttaagtaaa 


cgtattgcca 


acccttcggt 


taaactacgg 


1140 


gatactttag 


tgatagaaac 


cgattttgct 


gataacgaac 


tgttgctggg 


tattaagggc 


1200 


agggcagtga 


ttaccaaaga 


ttttaaatac 


attgtttatg 


acaaggggga 


gatccgggaa 


1260 


caattgtttg 


acctggaaaa 


agacgcagga 


gaaatggata 


acctggctgt 


taaacccgcc 


1320 


tataaaaaga 


aattgaatga 


aatgcgcgct 


tacctgaaac 


tatggtgtaa 


acagcaccag 


1380 


gattcgtttt 


atgcattaaa 


aaaataa 








1407 



<210> 39 

<211> 468 

<212> PRT 

<213> Flavobacterium heparinum 

<400> 39 

Ser Lys His Asn Met Lys Met Tyr Lys Ser hys Gly Trp Leu lie Ala 
1 5 10 15 

iVIet Leu lie Leu Ala Gly Phe Gly Asp Ala Gly Ala Gin Thr Ser Lys 
20 25 30 

Val Ala Ala Ser Arg Pro Asn lie lie lie lie Met Thr Asp Gin Gin 
35 40 45 

Thr Ala Asp Ala Met Ser Asn Ala Gly Asn Lys Asp Leu His Thr Pro 
50 55 60 

Ala Met Asp Val Leu Ala Ala Asn Gly Thr Arg Phe Thr Arg Ala Tyr 
65 70 75 80 

Cys Ala Gin Pro Leu Cys Thr Pro Ser Arg Ser Ala lie Phe Ser Gly 
85 90 95 

Lys Met Pro His Glu Thr Gly Phe Thr Gly Asn Thr Pro Glu Lys Asp 
lOO 105 110 

Gly Gin Trp Pro Asp Ser Val Leu Met Met Gly Lys lie Phe Lys Ala 
115 120 125 

Gly Gly Tyr Lys Thr Gly Tyr Val Gly Lys Trp His Leu Pro Val Pro 
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130 135 140 

Val Thr Lys Val Ala Gin His Gly Phe Glu Thr lie Glu Asn Thr Gly 
145 150 155 160 

Met Gly Asp Tyr Thr Asp Ala Val Thr Pro Ser Gin Cys Ala Asn Phe 
165 170 175 

Asn Lys Lys Asn Lys Asp Asn Pro Phe Leu Leu Val Ala Ser Phe Leu 
180 185 190 

Asn Pro His Asp lie Cys Glu Trp Ala Arg Gly Asp Asn Leu Lys Met 
195 200 205 

Asp Val Leu Asp Ala Ala Pro Asp Thr Ala Phe Cys Pro Lys Leu Pro 
210 215 220 

Ala Asn Trp Pro He Pro Ala Phe Glu Pro Ala He Val Arg Glu Gin 
225 230 235 240 

Gin Lys Val Asn Pro Arg Thr Tyr Pro Ser Val Gly Trp Asn Glu Ser 
245 250 255 

Gin Trp Arg Lys Tyr Arg Trp Ala Tyr Asn Arg Leu Val Glu Lys Val 
260 265 270 

Asp Asn Tyr Met Ala Met Val Leu Gly Ser Leu Lys Lys Tyr Gly He 
275 280 285 

Glu Asp Asn Thr He He He Phe Thr Ser Asp His Gly Asp Gly Tyr 
290 295' 300 

Ala Ala His Glu Trp Asn Gin Lys Gin He Leu Tyr Glu Glu Ala Ala 
305 310 315 320 

Arg He Pro Phe He He Ser Lys He Gly Gin Trp Lys Ala Arg Thr 
325 330 335 

Asp Asp Gin Leu Val Cys Asn Gly He Asp He He Pro Thr He Cys 
340 345 350 

Gly Phe Ala Gly He Ala Lys Pro Val Gly Leu Lys Gly Leu Asp Leu 
355 360 365 

Ser Lys Arg He Ala Asn Pro Ser Val Lys Leu Arg Asp Thr Leu Val 
370 375 380 

He Glu Thr Asp Phe Ala Asp Asn Glu Leu Leu Leu Gly He Lys Gly 
385 390 395 400 

Arg Ala Val He Thr Lys Asp Phe Lys Tyr He Val Tyr Asp Lys Gly 
405 410 415 

Glu He Arg Glu Gin Leu Phe Asp Leu Glu Lys Asp Ala Gly Glu Met 
420 425 430 

Asp Asn Leu Ala Val Lys Pro Ala Tyr Lys Lys Lys Leu Asn Glu Met 
435 440 445 

Arg Ala Tyr Leu Lys Leu Trp Cys Lys Gin His Gin Asp Ser Phe Tyr 
450 455 460 
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Ala Leu Lys Lys 
465 
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